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Abstract 

We give an algorithm that for an input n- vertex graph G and integer fc > 0, in time 2^^^in 

^^ I either outputs that the treewidth of G is larger than fc, or gives a tree decomposition of G of 

» . , width at most bk + 4. This is the first algorithm providing a constant factor approximation 

^ i' for treewidth which runs in time single-exponential in k and linear in n. 
•^ , Treewidth based computations are subroutines of numerous algorithms. Our algorithm 

^_ ' can be used to speed up many such algorithms to work in time which is single-exponential 

*vj ■ in the treewidth and linear in the input size. 

^ ■ 1 Introduction 

Q 

t^ ■ Since its invention in the 1980s, the notion of treewidth has come to play a central role in an 

enormous number of fields, ranging from very deep structural theories to highly applied areas. 
An important (but not the only) reason for the impact of the notion is that many graph problems 
that are intractable on general graphs become efficiently solvable when the input is a graph of 
bounded treewidth. In most cases, the first step of an algorithm is to find a tree decomposition 

CN \ of small width and the second step is to perform a dynamic programming procedure on the tree 

decomposition. 

In particular, if a graph on n vertices is given together with a tree decomposition of width /c, 

^^ \ many problems can be solved by dynamic programming in time 2^^^'n, i.e., single-exponential in 

C^ I the treewidth and linear in n. Many of the problems admitting such algorithms have been known 

for over thirty years [B] but new algorithmic techniques on graphs of bounded treewidth [111 
[21] as well as new problems motivated by various applications (just a few of many examples 

S^ I are [U [271 ED EH]) continue to be discovered. While a reasonably good tree decomposition 

H ■ can be derived from the properties of the problem sometimes, in most of the applications, the 

- - - computation of a good tree decomposition is a challenge. Hence the natural question here is what 

can be done when no tree decomposition is given. In other words, is there an algorithm that for 
a given graph G and integer k, in time 2^^^>n either correctly reports that the treewidth of G is 
at least k, or finds an optimal solution to our favorite problem (finds a maximum independent 
set, computes the chromatic number, decides if G is Hamiltonian, etc.)? To answer this question 
it would be sufficient to have an algorithm that in time 2^^^'n either reports correctly that the 
treewidth of G is more that /c, or construct a tree decomposition of width at most ck for some 
constant c. 
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However, the lack of such algorithms has been a bottleneck, both in theory and in practical 
applications of the treewidth concept. The existing approximation algorithms give us the choice 
of running times of the form 2^^^'n?, 2^^^^"^'^'n\ogn, or k^^'^ 'n, see Table[TJ Remarkably, the 
newest of these current record holders is now almost 20 years old. This "newest record holder" 
is the linear time algorithm of Bodlaender [71 [9] that given a graph G, decides if the treewidth 
of G is at most /c, and if so, gives a tree decomposition of width at most k in 0{k^ >n) time. 
The improvement by Perkovic and Reed p6] is only a factor polynomial in k faster (but also, 
if the treewidth is larger than k, it gives a subgraph of treewidth more than k with a tree 
decomposition of width at most fc, leading to an 0{Tn?) algorithm for the fundamental disjoint 
paths problem). Recently, a version running in logarithmic space was found by Elberfeld et 
al. [23], but its running time is not linear. 
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Table 1: Overview of treewidth algorithms. Here k is the treewidth and n is the number of 
vertices of an input graph G. Each of the algorithms outputs in time f{k) ■g{n) a decomposition 
of width given in the Approximation column. 

In this paper, we give the first constant factor approximation algorithm for the treewidth 
graph such that its running time is single exponential in treewidth and linear in the size of the 
input graph. Our main result is the following theorem. 

Theorem I. There exists an algorithm, that given an n-vertex graph G and an integer k, in time 
2^ 'n either outputs that the treewidth of G is larger than k, or constructs a tree decomposition 
of G of width at most 5k + 4. 

Of independent interest are a number of techniques that we use to obtain the result and 
the intermediate result of an algorithm that either tells that the treewidth is larger than k or 
outputs a tree decomposition of width at most 3fc + 4 in time 2'^('^'' nlogn. 



Related results and techniques. The basic shape of our algorithm is along the same lines 
as about all of the treewidth approximation algorithms [31 \IE[ [251 1321 [371 I4UJ . i.e., a specific 
scheme of repeatedly finding separators. If we ask for polynomial time approximation algorithms 
for treewidth, the currently best result is that of [25j that gives in polynomial (but not linear) 
time a tree decomposition of width 0{k ■ -y/log k) where k is the treewidth of the graph. Their 
work also gives a polynomial time approximation algorithm with ratio 0(|V//p) for i7- minor 



^Reed 37 does not state the approximation ratio of his algorithm exphcitly. However, a careful analysis of his 
manuscript show that the algorithm can be implemented to give a tree decomposition of width at most 8k + 0{l). 



free graphs. By Austrin et al. [S], assuming the Smah Set Expansion Conjecture, there is no 
polynomial time approximation algorithm for treewidth with a constant performance ratio. 

An important element in our algorithms is the use of a data structure that allows to perform 
various queries in time 0(c log n) each, for some constant c. This data structure is obtained 
by adding various new techniques to old ideas from the area of dynamic algorithms for graphs 
of bounded treewidth [HI HSl El [HI EH] . 

A central element in the data structure is a tree decomposition of the input graph of bounded 
(but too large) width such that the tree used in the tree decomposition is binary and of logarith- 
mic depth. To obtain this tree decomposition, we combine the following techniques: following 
the scheme of the exact linear time algorithms [9l [36], but replacing the call to the dynamic 
programming algorithm of Bodlaender and Kloks |15j by a recursive call to our algorithm, we 
obtain a tree decomposition of G of width at most lOfc + 9 (or 6/c + 9, in the case of the 
0(cn log n) algorithm of Section HI) Then, we use a result by Bodlaender and Hagerup [13] 
that this tree decomposition can be turned in a tree decomposition with a logarithmic depth 
binary tree in linear time, or more precisely, in O(logn) time and 0{n) operations on an EREW 
PRAM. The cost of this transformation is increasing the width of the decomposition roughly 
three times. The latter result is an application of classic results from parallel computing for 
solving problems on trees, in particular Miller- Reif tree contraction [34^135). 

Using the data structure to "implement" the algorithm of Robertson and Seymour |40j 
already gives an 0(c^n log n) 3-approximation for treewidth (Section |3|). Additional techniques 
are needed to speed this algorithm up. We build a series of algorithms, with running times 
of the forms 0(c n log log n), 0(c re log log log n), .. ., etc. Each algorithm "implements" Reeds 
algorithm [37], but with a different procedure to find balanced separators of the subgraph at 
hand, and stops when the subgraph at hand has size O(logn). In the latter case, we call the 
previous algorithm of the series on this subgraph. 

Finally, to obtain a linear time algorithm, we consider two cases, one case for when n is 
"small" (with respect to k), and one case when n is "large", where we consider n to be small if 
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re < 2 , for some constant cq. 

For small values of re, we apply 0(c^n log' ^ re) algorithm from Section [SJ This will yield a 
linear running time in re since log*^ ' n < k. For larger values of n, we show that the linear 
time algorithms of ^ or [3B] can be implemented in truly linear time, without any overhead 
depending on k. This seemingly surprising result can be roughly obtained as follows. We 
explicitly construct the finite state tree automaton of the dynamic programming algorithm in 
sublinear time before processing the graph, and then view the dynamic programming routine 
as a run of the automaton, where productions are implemented as constant time table lookups. 
Viewing a dynamic programming algorithm on a tree decomposition as a finite state automaton 
traces back to early work by Fellows and Langston [26j, see e.g., also |:2j. Our algorithm assumes 
the RAM model of computation [H] , and the only aspect of the RAM model which is exploited 
by our algorithm is the ability to look up an entry in a table in constant time, independently 
of the size of the table. This capability is crucially used in almost every linear time graph 
algorithm including breadth first search and depth first search. 

Overvie'w of the paper. In Section[2]we give the outline of the main algorithms, focusing on 
explaining main intuitions rather than formal details of the proofs. Some concluding remarks 
and open questions are made in Section [3l Sections HI [5] and [6] give the formal descriptions of the 
main algorithms: first the O(c'^relogre) algorithm, then the series of 0(c^re log*^"-* n) algorithms, 
before describing the 0{c n) algorithm. Each of the algorithms described in these sections use 
queries to a data structure which is described in Section [71 



Notation. We give some basic definitions and notation, used throughout the paper. For 
a G N, the function log*^"^^ n is defined as fohows: log^ ' n = logn, and for a > 1, log^"' n = 
log(log("-^)n). 

For the presentation of our results, it is more convenient when we regard tree decompositions 
as rooted. This yields the following definition of tree decompositions. 

Definition 1.1. A tree decomposition of a graph G = {V,E) is a pair T = {{Bi \ i € I},T = 
(/, F)) where T = (/, F) is a rooted tree, and {Bi | i G /} is a family of subsets of V, such that 

• for all {v, w} € E, there exists an i E / with v,w S Bi, and 

• ior all V £ V , ly = {i £ I \ V £ Bi} induces a subtree of T. 

The width oiT = {{Bi \ i € I},T = {I,F)), denoted w(7~) is maxjg/ \Bi\ — 1. The treewidth of 
a graph G, denoted by tw(G), is the minimum width of a tree decomposition of G. 

For each v £ V, the tree induced by /„ is denoted by T^,; the root of this tree, i.e., the node 
in the tree with smallest distance to the root of T is denoted by r^. 

The sets Bi are called the bags of the decomposition. For each node i £ I, we define Vi to be 
the union of bags contained in the subtree of T rooted in i, including B^. Moreover, we denote 
Wi = Vi \ Bi and Gi = G[Vi\, Hi = G[Wi]. Note that by the definition of tree decomposition, 
Bi separates Wi from V \Vi. 

2 Proof outline 

Our algorithm builds on the constant factor approximation algorithm for treewidth described 
in Graph Minors XIII [ID] with running time 0{c^v?). We start with a brief explanation of a 
variant of this algorithm. 

2.1 The 0{2)^^Tn?) time 4- approximation algorithm from Graph Minors XIII. 

The engine behind the algorithm is a lemma that states that graphs of treewidth k have balanced 
separators of size k + 1. In particular, for any way to assign non-negative weights to the vertices 
there exists a set X of size at most k + 1 such that the total weight of any connected component 
of G \ X is at most half of the total weight of G. We will use the variant of the lemma where 
some vertices have weight 1 and some have weight 0. 

Lemma 2.1 (Graph Minors II t39j). //tw(G) < k and S Q V{G), then there exists X C V{G) 
with \X\ < A: + 1 such that every component ofG\X has at most 2 15*1 vertices which are in S. 

We note that the original version of [39J is seemingly stronger: it gives bound 2 l-^yxl instead 
of 2\^\- However, we do not need this stronger version and we find it more convenient to work 
with the weaker. The set X with properties ensured by Lemma 12.11 will be called a balanced 
S -separator, or a -^-balanced S -separator. More generally, for an f3 -balanced S -separator X 
every connected component of G \ X contains at most /3\S\ vertices of S. If we omit the set S, 
i.e., talk about separators instead of S'-separators, we mean S = V{G) and balanced separators 
of the whole vertex set. 

The proof of Lemma 12.11 is not too hard; start with a tree decomposition of G with width 
at most k and orient every edge of the decomposition tree towards the side which contains the 
larger part of the set S. Two edges of the decomposition can not point "in diff'erent directions" , 



since then there would be disjoint parts of the tree, both containing more than half of S. Thus 
there has to be a node in the decomposition tree such that all edges of the decomposition are 
oriented towards it. The bag of the decomposition corresponding to this node is exactly the 
set X of at most k + 1 vertices whose deletion leaves connected components with at most ^|S| 
vertices of S each. 

The proof of Lemma 12.11 is constructive if one has access to a tree decomposition of G of 
width less than k. The algorithm does not have such a decomposition at hand, after all we are 
trying to compute a decomposition of G of small width. Thus we have to settle for the following 
algorithmic variant of lemma [39J . 



Lemma 2.2 ([40]). There is an algorithm that given a graph G, a set S and a fc G N either 
concludes that tw(G') > k or outputs a set X of size at most k + 1 such that every component 
of G\X has at most ^\S\ vertices which are in S and runs in time 0{3'''^'k^^^\n + m)). 

Proof sketch. By Lemma[2T] there exists a set X' of size at most k+1 such that every component 
oi G\ X' has at most 2 1*?! vertices which are in S. A simple packing argument shows that the 
components can be assigned to left or right such that at most glS"! vertices of S go left and at 
most 1 1 5*1 go right. Let Sx he S H X' and let Sl and Sr be the vertices of S that were put 
left and right respectively. By trying all partitions of S in three parts the algorithm correctly 
guesses Sx, Sl and Sr. Now X separates Sl from Sr and so the minimum vertex cut between 
Sl and Sr in G\ Sx is at most \X \ Sx\ < (^ + 1) ~ I'S'xl- The algorithm finds using max- flow 
a set Z of size at most {k + 1) — \Sx\ that separates Sl from Sr in G\ Sx- Since we are only 
interested in a set Z of size at most k — \Sx\ one can run max- flow in time 0{{n + m)k'-^^^'). 
Having found Sl, Sr, Sx and Z the algorithm sets X = Sx U Z, L to contain all components 
oi G\X that contain vertices of Sl and R to contain all other vertices. Since every component 
C of G \ X is fully contained in L or R, the bound on |C n Sj follows. 

If no partition of S into Sl, Sr, Sx yielded a cutset Z of size < (A: + 1) — \Sx\, this means 
that tw(G) > k, which the algorithm reports. D 

The algorithm takes as input G, k and a set S on at most 3k+3 vertices, and either concludes 
that the treewidth of G is larger than k or finds a tree decomposition of width at most 4/c + 3 
such that the top bag of the decomposition contains S. 

On input G, S, k the algorithm starts by ensuring that \S\ = 3k + 3. If IS*! < 3A; + 3 
the algorithm just adds arbitrary vertices to S until equality is obtained. Then the algorithm 
applies Lemma [2. 2 1 and finds a set X of size at most k + 1 such that each component Cj oi G\X 
satisfies |Ci n S] < ^ < 2A; + 2. Thus for each d we have \{S n Q) LlX\<3k + 3. For each 
component Gi oi G \ X the algorithm runs itself recursively on {G[Gi U X],{S Ci Gi) U X,k). 

If either of the recursive calls returns that the treewidth is more than k then the treewidth 
of G is more than k as well. Otherwise we have for every component Gi a tree decomposition 
of G[Gi U X] of width at most 4/c + 3 such that the top bag contains {S n Gi) U X. To make a 
tree decomposition of G we make a new root node with bag Xyj S, and connect this bag to the 
roots of the tree decompositions of G[Gi U X] for each component Cj. It is easy to verify that 
this is indeed a tree decomposition of G. The top bag contains S, and the size of the top bag is 
at most |5| -|- |X| < 4A: + 4, and so the width if the decomposition is at most 4A; + 3 as claimed. 

The running time of the algorithm is governed by the recurrence 

T{n,k) <0(3l^lA:^W(n + m)) + ^r(|C,UX|,A;) (1) 

Ci 

which solves to T{n, k) < {3^^k'^^^'n{n-irm)) since l^l = 3A; + 3 and there always are at least two 
non-empty components oi G\X. Finally, we use the following observation about the number 
of edges in a graph of treewidth k. 



Lemma 2.3 ([I2])- Let G = {V,E) he a graph with treewidth at most k. Then \E\ < \V\k. 

Thus if \E\ > nk the algorithm can safely output that tw(G) > k. After this, running the 
algorithm above takes time 0{3^''k^^^^n{n + m)) = 0{3^'^k^^^^n'^). 

2.2 The 0{k'^^''^n\ogn) time approximation algorithm of Reed. 

Reed [37J observed that the running time algorithm of Robertson and Seymour [50] can be sped 
up from O(n^) for fixed k to 0(n log n) for fixed k, at the cost of a worse (but still constant) 
approximation ratio, and a /c'^'*^) dependence on k in the running time, rather than the 3^^ 
factor in the algorithm of Robertson and Seymour. We remark here that Reed [37] never 
states explicitly the dependence on k of his algorithm, but a careful analysis shows that this 
dependence is in fact of order k^^^' . The main idea of this algorithm is that the recurrence in 
Equation [1] only solves to O(n^) for fixed k if one of the components of G \ X contains almost 
all of the vertices of G. If one could ensure that each component Ci oi G\X had at most c • n 
vertices for some fixed c < 1, the recurrence in Equation [1] solves to 0(n log n) for fixed k. To 
see that this is true we simply consider the recursion tree. The total amount of work done at 
any level of the recursion tree is 0{n) for a fixed k. Since the size of the components considered 
at one level is always a constant factor smaller than the size of the components considered in 
the previous level, the number of levels is only O(logn) and we have 0(n log n) work in total. 
By using Lemma |2 . 1 1 with S = V{G) we see that if G has treewidth < A;, then there is a set 
X of size at most k + 1 such that each component oi G\X has size at most ^. Unfortunately 
if we try to apply Lemma 12.21 to find an X which splits G in a balanced way using S = V{G), 
the algorithm of Lemma 12. 2l takes time 0{?>\^\k^^^> [n + m)) = 0{2>^nP^^>), which is exponential 
in n. Reed j^STj gave an algorithmic variant of Lemma [2T] especially tailored for the case where 
S = V{G). 

Lemma 2.4 ([37]). There is an algorithm that given G and k, runs in time 0{k^^^'n) and 
either concludes that tw(G) > k or outputs a set X of size at most k + 1 such that that every 
component ofG\X has at most ^\S\ vertices which are in S. 

Let us remark that Lemma [2.4l as stated here is never explicitly proved in [37], but it follows 
easily from the arguments given there. 

Having Lemmata 12.21 and 12.41 at hand, we show how to obtain an 8-approximation of 
treewidth in time O(fc'^'-'^-'nlogn). The algorithm takes as input G, k and a set S on at most 
6A; + 6 vertices, and either concludes that the treewidth of G is at least k, or finds a tree 
decomposition of width at most 8k + 7 such that the top bag of the decomposition contains S. 

On input G, S, k the algorithm starts by ensuring that |5| =6^ + 6. If l^l < 6k + 6 
the algorithm just adds vertices to S until equality is obtained. Then the algorithm applies 
Lemma 12.21 and finds a set Xi of size at most k + 1 such that each component C, of G \ Xi 
satisfies |Cj n S"! < ■g|<S'| < 4A; + 4. Now the algorithm applies Lemma 12.41 and finds a set X2 of 
size at most k + 1 such that each component Ci of G\ X2 satisfies |Gj| < ||1^(G)| < |n. Set 
X = Xi U X2. For each component Ci of G \ S we have that | (S n Ci) L) X\ < 6fc + 6. For each 
component Ci of G \ X the algorithm runs itself recursively on {G[Ci U X],{S Ci Ci) U X, k). 

If either of the recursive calls returns that the treewidth is more than k then the treewidth 
of G is more than k as well. Otherwise we have for every component Gj a tree decomposition 
of G[Ci U X] of width at most 8k + 7 such that the top bag contains {S n Ci) U X. Similarly 
as before, to make a tree decomposition of G we make a new root node with bag X U S, and 
connect this bag to the roots of the tree decompositions of G[Ci U X] for each component Gj. 
It is easy to verify that this is indeed a tree decomposition of G. The top bag contains S, and 



the size of the top bag is at most IS"] + |X| < IS"] + |Xi| + IX2I < 6A; + 6 + 2A; + 2 = 8fc + 8, and 
the width of the decomposition is at most 8A; + 7 as claimed. 

The running time of the algorithm is governed by the recurrence 

T{n,k) <0('fc°W(n + m)) +^T(|QuX|,fc) (2) 

Ci 

which solves to T{n, k) < 0{k^^^'{n+m) log n) since each Ci has size at most |n. By Lemma F2.3l 
we have m < kn and so the running time of the algorithm is upper bounded by 0{k^ 'n log n). 

2.3 A new 0{c^n\ogn) time 3-approximation algorithm 

The goal of this section is to sketch a proof of the following theorem. A full proof of Theorem HIl 
can be found in Section HI 

Theorem II. There exists an algorithm which given a graph G and an integer k, either computes 
a tree decomposition of G of width at most 3/c + 4 or correctly concludes that tw(G) > k, in 
time 0{c^ ■ nlogn) for some c G N. 

The algorithm employs the same recursive compression scheme which is used in Bodlaender's 
linear time algorithm [3 [9] and the algorithm of Perkovic and Reed [36] . The idea is to solve 
the problem recursively on a smaller instance, expand the obtained tree decomposition of the 
smaller graph to a "good, but not quite good enough" tree decomposition of the instance in 
question, and then use this tree decomposition to either conclude that tw(G) > fc or find a 
decomposition of G which is good enough. A central concept in this recursive approach of [U] 
is the definition of an improved graph: 

Definition 2.5. Given a graph G = {V, E) and an integer /c, the improved graph of G, denoted 
Gj, is obtained by adding an edge between each pair of vertices with at least k + 1 common 
neighbors of degree at most k in G. 

Intuitively, adding the edges during construction of the improved graph cannot spoil any 
tree decomposition of G of width at most k, as the pairs of vertices connected by the new edges 
will need to be contained together in some bag anyway. This is captured in the following lemma. 

Lemma 2.6. Given a graph G and an integer fc E N, tw(G) < k if and only iftw{Gi) < k. 

If \E\ = 0{kn), which is the case in graphs of treewidth at most k, the improved graph can 
be computed in 0{k^^^' • n) time using radix sort [9j. 

A vertex v G G is /-simplicial, if it is simplicial in the improved graph Gj. The intuition 
behind /-simplicial vertices is as follows: all the neighbors of an /-simplicial vertex must be 
simultaneously contained in some bag of any tree decomposition of Gi of width at most k, so 
we can safely remove such vertices from the improved graph, compute the tree decomposition, 
and reintroduce the removed /-simplicial vertices. The crucial observation is that if no large 
set of /-simplicial vertices can be found, then one can identify a large matching, which can be 
also used for a robust recursion step. The following lemma, which follows from the work of 
Bodlaender [9|, encapsulates all the main ingredients that we will use. 

Lemma 2.7. There is an algorithm working in 0{k^^'n) time that, given a graph G = {V,E) 
and an integer k, either 

(i) returns a maximal matching in G of cardinality at least J,,L , 



\v\ 



or, 



(a) returns a set of at least J.,L I-simplicial vertices 

(in) correctly concludes that the treewidth of G is larger than k. 

Moreover, if a set X of at least J.,L I-simplicial vertices is returned, and the algorithm is 
in addition provided with some tree decomposition 7/ of Gi \ X of width at most k, then in 
0{k'-'^^' ■ n) time one can turn Ti into a tree decomposition T of G of width at most k, or 
conclude that the treewidth of G is larger than k. 

Lemma 12.71 allows us to reduce the problem to a compression variant where we are given a 
graph G, an integer k and a tree decomposition of G of width 0{k), and the goal is to either 
conclude that the treewidth of G is at least k or output a tree decomposition of G of width 
at most 3k + 4. The proof of Theorem [II] has two parts: an algorithm for the compression 
step and an algorithm for the general problem that uses the algorithm for the compression step 
together with Lemma 12.71 as black boxes. We now state the properties of our algorithm for the 
compression step in the following lemma. 

Lemma 2.8. There exists an algorithm which on input G,k, So,Tapx, where (i) Sq C V{G), 
\So\ < 2k + 3, (a) G and G \ Sq are connected, and (Hi) Tapx is a tree decomposition of G of 
width at most 0{k), in 0{c -nlogn) time for some c S N either computes a tree decomposition 
T of G with w(T) < 3A; + 4 and Sq as the root bag, or correctly concludes that tw(G) > k. 

We now give a proof of Theorem HH assuming the correctness of Lemma [2. 81 The correctness 
of the lemma will be argued for in Sections 12.3.11 and 12.3.21 

Proof of Theorem O Our algorithm will in fact solve a slightly more general problem. Here we 
are given a graph G, an integer k and a set Sq on at most 2k + 3 vertices, with the property 
that G \ Sq is connected. The algorithm will either conclude that tw(G) > A; or output a tree 
decomposition of width at most 3k + 4 such that Sq is the root bag. To get a tree decomposition 
of any (possibly disconnected) graph it is sufficient to run this algorithm on each connected 
component with 5*0 = 0. The algorithm proceeds as follows. It first applies Lemma 12.71 on 
{G,3k + 4:). If the algorithm of Lemma 12. 71 concludes that tw(G) > 3A; + 4 the algorithm reports 
that tw(G) >3k + 4> k. 

If the algorithm finds a matching M in G with at least J.,L edges, it contracts every edge 
in M and obtains a graph G' . Since G' is a minor of G we know that tw(G') < tw(G). The 
algorithm runs itself recursively on (G',A;,0), and either concludes that tw(G') > k (implying 
tw(G) > A;) or outputs a tree decomposition of G' of width at most 3k + 4. Uncontracting the 
matching in this tree decomposition yields a tree decomposition TIpx of G of width at most 
6k + 9 [9\. Now we can run the algorithm of Lemma \TE\ on (G, k, 5o, TIpx) and either obtain a 
tree decomposition of G of width at most 3A; + 4 and 5*0 as the root bag, or correctly conclude 
that tw(G) > k. 

If the algorithm finds a set X of at least J.,L I-simplicial vertices, it constructs the improved 
graph G/ and runs itself recursively on (G/ \ X, k, 0). If the algorithm concludes that tw(G/ \ 
X) > k then tw(G/) > k implying tw(G) > k hy Lemma 12.61 Otherwise we obtain a tree 
decomposition of Gj \ X of width at most 3k + 4. We may now apply Lemma 12.71 and obtain 
a tree decomposition TIpx of G with the same width. Note that we can not just output TIpx 
directly, since we can not be sure that Sq is the top bag of TIpx- However we can run the 
algorithm of Lemma [TSl on (G, k, So, TIpx) and either obtain a tree decomposition of G of width 
at most 3A; + 4 and Sq as the root bag, or correctly conclude that tw(G) > k. 

It remains to analyze the running time of the algorithm. Suppose the algorithm takes time 
at most T{n,k) on input {G,k,So) where n = \V{G)\. Running the algorithm of Lemma 12.71 



takes 0{k^^'n) time. Then the algorithm either halts, or calls itself recursively on a graph with 
at most n — qJIg) = n{l — qAst) vertices taking time T(n(l — qt^), k). Then the algorithm 

takes time 0{k'^^^'n) to either conclude that tw(G) > k or to construct a tree decomposition 
TIpx of G of width 0{k). In the latter case we finally run the algorithm of Lemma 12.81 taking 
time 0{c ■ nlogn). This gives the following recurrence: 



T{n,k) < O (c^ -nlogn) + T Tn ("l - — ^"j 



k 



The recurrence leads to a geometric series and solves to T{n, k) < 0{c^k^^^' -n log n), completing 
the proof. For a thorough analysis of the recurrence, see Equations [3] and [H in Section HI 
Pseudocode for the algorithm described here is given in Algorithm [1] in Section HI D 

2.3.1 A compression algorithm 

We now proceed to give a sketch of a proof for a slightly weakened form of Lemma [2.8[ The goal 
is to give an algorithm that given as input a graph G, an integer A;, a set ^o of size at most 6fc + 6 
such that G \ 5o is connected, and a tree decomposition TIpx of G, runs in time ©(c'^n log n) 
and either correctly concludes that tw(G) > k or outputs a tree decomposition of G of width 
at most 8k + 7. The paper does not contain a full proof of this variant of Lemma [2.8l — we will 
discuss the proof of Lemma l2.8l in Section [2.3.21 The aim of this section is to demonstrate that 
the recursive scheme of Section 12.31 together with a nice trick for finding balanced separators is 
already sufficient to obtain a factor 8 approximation for treewidth running in time 0(c nlogn). 
A variant of the trick used in this section for computing balanced separators turns out to be 
useful in our final 0{c n) time 5-approximation algorithm. 

The route we follow here is to apply the algorithm of Reed described in Section 12.21 but 
instead of using Lemma 12.41 to find a set X of size k + 1 such that every connected component 
of G \ X is small, finding X by dynamic programming over the tree decomposition TIpx in time 
0{c n). There are a few technical difficulties with this approach. 

The most serious issue is that, to the best of our knowledge, the only known dynamic 
programming algorithms for balanced separators in graphs of bounded treewidth take time 
0{c n'^) rather than 0[c n): in the state, apart from a partition of the bag, we also need to 
store the cardinalities of the sides which gives us another dimension of size n. We now explain 
how it is possible to overcome this issue. We start by applying the argument in the proof of 
Lemma |2. II on the tree decomposition TIpx and get in time 0(A;'-^'^'n) a partition of V{G) into 
Lq, Xq and Rq such that there are no edges between Lq and Rq, max(|Lo|, |i?o|) < |^ and 
l-'^ol < w(7Ipx) + 1- For every way of writing k + 1 = k^ + kx + kji and every partition of Xq 
into Xl U Xx U Xfi with \Xx\ = kx we do the following. 

First we find in time 0{c^n) using dynamic programming over the tree decomposition TIpx a 
partition oi LqUXq into LiURlUXl such that there are no edges from Ll to Rl, \Xl\ < k^+kx, 
Xx C Xi, Xji C Ri^ and Xl C Ll and the size \Li\ is maximized. Then we find in time 
0{c''n) using dynamic programming over the tree decomposition TIpx a partition of Rq U Xq 
into Lr U Rr U Xr such that there are no edges from Lr to Rr, \Xii\ < kR + kx, Xx C Xr, 
Xr C Rr and Xi C Lr and the size \Rr\ is maximized. Let L = L^ U Lr, R = Rl U Rr and 
X = Xl U Xr. The sets L, X, R form a partition of V{G) with no edges from L to R and 
\X\<kL + kx + kR + kx-kx<k + 1. 

It is possible to show using a combinatorial argument (see Lemma 17.51 in Section [7]) that if 
tw(G) < k then there exists a choice of kL, kx, kR such that k + 1 = kL + kx + kR and partition 
of Xq into Xl U Xx U Xr with \Xx \ = kx such that the above algorithm will output a partition 
of V{G) into X, L and R such that max(|L|, \R\) < ^. Thus we have an algorithm that in 



time 0{c^n) either finds a set X of size at most k + \ such that each connected component of 
G\X has size at most ^ or correctly concludes that tw(G) > k. 

The second problem with the approach is that the algorithm of Reed is an 8-approximation 
algorithm rather than a 3-approximation. Thus, even the sped up version does not quite prove 
Lemma 12.81 It does however yield a version of Lemma 12.81 where the compression algorithm is 
an 8-approximation. In the proof of Theorem |TI] there is nothing special about the number 3 
and so one can use this weaker variant of Lemma 12.81 to give a 8-approximation algorithm for 
treewidth in time 0{c^n\ogn). We will not give complete details of this algorithm, as we will 
shortly describe a proof of Lemma 12.81 using a quite different route. 

It looks difficult to improve the algorithm above to an algorithm with running time 0{c n). 
The main hurdle is the following: both the algorithm of Robertson and Seymour [3(3] and the 
algorithm of Reed [37J find a separator X and proceed recursively on the components of G \ X. 
If we use 0{c^ ■ n) time to find the separator X, then the total running time must be at least 
0{c ■ n ■ d) where d is the depth of the recursion tree of the algorithm. It is easy to see that 
the depth of the tree decomposition output by the algorithms equals (up to constant factors) 
the depth of the recursion tree. However there exist graphs of treewidth k such that no tree 
decomposition of depth o(logn) has width 0{k) (take for example powers of paths). Thus the 
depth of the constructed tree decompositions, and hence the recursion depth of the algorithm 
must be at least ri(logn). 

Even if we somehow managed to reuse computations and find the separator X in time 
0{c • i3^) on average, we would still be in trouble since we need to pass on the list of vertices 
of the connected components of G \ X that we will call the algorithm on recursively. At a first 
glance this has to take 0{n) time and then we are stuck with an algorithm with running time 
0{{c^ ■ 'yp-^ + n) ■ d), where d is the recursion depth of the algorithm. For d = logn this is still 
0{c^n + nlogn) which is slower than what we are aiming at. In Section [2.3.21 we give a proof 
of Lemma [2^ that almost overcomes these issues. 



2.3.2 A better compression algorithm 

We give a sketch of the proof of Lemma 12. 8[ The goal is to give an algorithm that given as 
input a connected graph G, an integer k, a set ^o of size at most 2fc -|- 3 such that G \ 5*0 is 
connected, and a tree decomposition TIpx of G, runs in time 0{c^n\ogn) and either correctly 
concludes that tw(G) > A; or outputs a tree decomposition of G of width at most 3/c -|- 4 with 
top bag 5*0. 

Our strategy is to implement the 0{c n^) time 4-approximation algorithm described in 
Section 12.11 but make some crucial changes in order to (a) make the implementation run in 
0(cn log n) time, and (b) make it a 3-approximation rather than a 4-approximation. We first 
turn to the easier of the two changes, namely making the algorithm a 3-approximation. 

To get an algorithm that satisfies all of the requirements of Lemma 12.81 but runs in time 
O^c^'n^) rather than 0{c^nlogn) we run the algorithm described in Section [2. II setting S = Sq 
in the beginning. Instead of using Lemma 12.21 to find a set X such that every component of 
G\ X has at most ^\S\ vertices which are in S, we apply Lemma l2. II to show the existence of 
an X such that every component of G \ X has at most ^\S\ vertices which are in S, and do 
dynamic programming over the tree decomposition TIpx in time 0{c n) in order to find such 
an X. Going through the analysis of Section [2.11 but with X satisfying that every component 
of G \ X has at most 2 1*?! vertices which are in S shows that the algorithm does in fact output 
a tree decomposition with width 3A; + 4 and top bag Sq whenever tw(G) < k. 

It is somewhat non-trivial to do dynamic programming over the tree decomposition TIpx in 
time 0{c^n) in order to find an X such that every component of G\X has at most llS*! vertices 
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which are in S. The problem is that G\X could potentially have many components and we do 
not have time to store information about each of these components individually. The following 
lemma, whose proof can be found in Section 17.4.21 shows how to deal with this problem. 

Lemma 2.9. Let G be a graph and S C V{G). Then a set X is a balanced S -separator if and 
only if there exists a partition (Mi , M2 , M3) of V{G) \ X , such that there is no edge between Mi 
and Mj for i / j, and \Mi n S\ < \S\/2 for i = 1,2, 3. 

Lemma 12.91 shows that when looking for a balanced /S-separator we can just look for a 
partition of G into four sets X,Mi,M2,M3 such that there is no edge between Mj and Mj for 
i 7^ j, and |Mj n /SI < \S\/2 for i = 1,2, 3. This can easily be done in time 0{c n) by dynamic 
programming over the tree decomposition TIpx- This yields the promised algorithm that satisfies 
all of the requirements of Lemma 12.81 but runs in time 0{c v?) rather than 0(c n log n). 

We now turn to the most difficult part of the proof of Lemma 12.81 namely how to improve 
the running time of the algorithm above from 0{c^n'^) to 0(c^n log n) in a way that gives hope 
of a further improvement to running time 0{c n). The 0(cn log n) time algorithm we describe 
now is based on the following observations; (a) In any recursive call of the algorithm above, 
the considered graph is an induced subgraph of G. Specifically the considered graph is always 
G[G U S] where 5 is a set with at most 2A; + 3 vertices and C is a connected component of 
G\S. (b) The only computationally hard step, finding the balanced 5-separator X, is done 
by dynamic programming over the tree decomposition TIpx of G. The observations (a) and 
(b) give some hope that one can reuse the computations done in the dynamic programming 
when finding a balanced S-separator for G during the computation of balanced S'-separators in 
induced subgraphs of G. This plan can be carried out in a surprisingly clean manner and we 
now give a rough sketch of how it can be done. 

We start by preprocessing the tree decomposition using an algorithm of Bodlaender and 
Hagerup [14J . This algorithm is a parallel algorithm and here we state its sequential form. 

Proposition 2.10 (Bodlaender and Hagerup [14]). There is an algorithm that, given a tree 
decomposition of width k with 0{n) nodes of a graph G, finds a rooted binary tree decomposition 
of G of width at most 3/c + 2 with depth O(logn) in 0{kn) time. 

Proposition 12.101 lets us assume without loss of generality that the tree decomposition TIpx 
has depth O(logn). 

In Section [7] we will describe a data structure with the following properties. The data 
structure takes as input a graph G, an integer k and a tree decomposition TIpx of width 0{k) 
and depth O(logn). After an initialization step which takes 0{c^n) time the data structure 
allows us to do certain operations and queries. At any point of time the data structure is in a 
certain state. The operations allow us to change the state of the data structure. Formally, the 
state of the data structure is a 3-tuple {S,X,F) of subsets of V{G) and a vertex vr called the 
"pin", with the restriction that tt ^ S. The initial state of the data structure is that S = Sq, 
X = F = f/i, and vr is an arbitrary vertex ol G\ Sq. The data structure allows operations that 
change S, X or F by inserting/deleting a specified vertex, and move the pin to a specified vertex 
in time 0{c logn). 

For a fixed state of the data structure, the active component U is the component of G \ 5 
which contains vr. The data structure allows the query findSSeparator which outputs in time 
0(c logn) either an 5-balanced separator X of G[U U S] of size at most A; + 1, or _L, which 
means that tw(G[S' U U]) > k. 

The algorithm of Lemma [2.81 runs the 0{c ri^) time algorithm described above, but uses the 
data structure to find the balanced 5-separator in time O(c'^logn) instead of doing dynamic 
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programming over TIpx- All we need to make sure is that the S in the state of the data structure 
is always equal to the set S for which we want to find the balanced separator, and that the 
active component U is set such that G[U U S] is equal to the induced subgraph we are working 
on. Since we always maintain that \S\ < 2k + 3 we can change the set S to anywhere in the 
graph (and specifically into the correct position) by doing A;*-^'^' operations taking O(c'^logn) 
time each. 

At a glance, it looks like that if we assume the data structure as a black box, this is sufficient 
to obtain the desired 0(cn log n) time algorithm. However, we haven't even used the sets X 
and F in the state of the data structure, or described what they mean! The reason for this is 
of course that there is a complication. In particular, after the balanced S'-separator X is found 
— how can we recurse into the connected components of G[S UU]\{SU X)? We need to move 
the pin into each of these components one at a time, but if we want to use 0(c'^ log n) time 
in each recursion step, we cannot afford to spend 0{\S U U\) time to compute the connected 
components oi G[S U U] \ {S U X) . We resolve this issue by pushing the problem into the data 
structure, and showing that the appropriate queries can be implemented there. This is where 
the sets X and F in the state of the data structure come in. 

The role of X in the data structure is that when queries to the data structure depending on X 
are called, X equals the set X, i.e., the balanced S-separator found by the query findSSeparator. 
The set F is a set of "finished pins" whose intention is the following: when the algorithm calls 
itself recursively on a component U' of G[S U U]\ {S U X) after it is done with computing a 
tree decomposition of G[U' U N(U')] with N{U') as its top bag, it selects an arbitrary vertex of 
U' and inserts it into F. 

The query findNextPin finds a new pin vr' in a component U' of G[S U U]\ (S D X) that 
does not contain any vertices of F. And finally, the query findNeighborhood allows us to find 
the neighborhood N{U'), which in turn allows us to call the algorithm recursively in order to 
find a tree decomposition of G[U' U N{U')] with N{U') as its top bag. 

At this point it is possible to convince oneself that the 0{c''n'^) time algorithm described in 
the beginning of this section can be implemented using 0{k^^') calls to the data structure in 
each recursive step, thus spending only O(c'^logn) time in each recursive step. Pseudocode for 
this algorithm can be found in Algorithm [3l The recurrence bounding the running time of the 
algorithm then becomes 

T{n,k) < 0{c''logn) + ^T{\UiU X\,k). 

Ui 

Here Ui, . . . ,Uq are the connected components of G[S U f7] \ (5" U X). This recurrence solves to 
0(c n log n), proving Lemma 12.81 A full proof of Lemma 12.81 assuming the data structure as a 
black box may be found in Section 14. 2[ 



2.4 The data structure 

We sketch the main ideas in the implementation of the data structure. The goal is to set up 
a data structure that takes as input a graph G, an integer k and a tree decomposition TIpx of 
width 0{k) and depth O(logn), and initializes in time 0{c^n). The state of the data structure 
is a 4-tuple {S, X, F, vr) where S, X and F are vertex sets in G and vr G V{G) \ S. The initial 
state of the data structure is (50,0,0,^) where v is an arbitrary vertex in G \ Sq. The data 
structure should support operations that insert (delete) a single vertex to (from) S, X and F, 
and an operation to change the pin vr to a specified vertex. These operations should run in time 
0{c'' logn). For a given state of the data structure, set U to be the component of G\ S that 
contains tt. The data structure should also support the following queries in time 0{c'' logn). 

12 



• findSSeparator: Assuming that \S\ < 2k + 3, return a set X of size at most k + 1 such 
that every component of G[S U U]\ X contains at most 2l'S'| vertices of S, or conclude 
that tw(G) > k. 

• findNextPin: Return a vertex ir' in a component U' oi G[S UU]\ (S U X) that does not 
contain any vertices of F. 

• findNeighborhood: Return N{U) if \N{U)\ < 2k + 3 and _L otherwise. 

Suppose for now that we want to set up a much simpler data structure. Here the state is just 
the set S and the only query we want to support is findSSeparator which returns a set X such 
that every component of G\{SUX) contains at most ^\S\ vertices of S, or conclude that tw(G) > 
k. At our disposal we have the tree decomposition TIpx of width 0{k) and depth O(logn). To 
set up the data structure we run a standard dynamic programming algorithm for finding X 
given S. Here we use Lemma 12.91 and search for a partition of V{G) into {Mi,M2,M3,X) such 
that \X\ < k + 1, there is no edge between Mj and Mj for i / j, and |Mj n S"] < \S\/2 for 
i = 1, 2, 3. This can be done in time 0{c k^-^'n) and the tables stored at each node of the tree 
decomposition have size 0{c k^^'). This finishes the initialization step of the data structure. 
The initialization step took time 0{c^k^^^>n). 

We will assume without loss of generality that the top bag of the decomposition is empty. 
The data structure will maintain the following invariant: after every change has been performed 
the tables stored at each node of the tree decomposition correspond to a valid execution of the 
dynamic programming algorithm on input (G, S). If we are able to maintain this invariant, then 
answering findSSeparator queries is easy: assuming that each cell of the dynamic programming 
table also stores solution sets (whose size is at most A; + 1) we can just output in time 0{k^^^>) 
the content of the top bag of the decomposition! 

But how to maintain the invariant and support changes in time 0(c'^ log n)? It turns out 
that this is not too difficult: the content of the dynamic programming table of a node t in the 
tree decomposition depends only on S and the dynamic programming tables of t's children. 
Thus, when the dynamic programming table of the node t is changed, this will only affect 
the dynamic programming tables of the O(logn) ancestors of t. If the dynamic program is 
done carefully, one can ensure that adding or removing a vertex to/from S will only affect the 
dynamic programming tables for a single node t in the decomposition, together with all of its 
O(logn) ancestors. Performing the changes amounts to recomputing the dynamic programming 
tables for these nodes, and this takes time 0{c k^^' logn). 

It should now be plausible that the idea above can be extended to work also for the more 
complicated data structure with the more advanced queries. Of course there are several technical 
difficulties, the main one is how to ensure that the computation is done in the connected 
component U oi G\S without having to store "all possible ways the vertices in a bag could be 
connected below the bag" . We omit the details of how this can be done in this outline. The 
full exposition of the data structure can be found in Section [71 

2.5 Approximating treewidth in (9(c'^nlog'"^ n) time. 

We now sketch how the algorithm of the previous section can be sped up, at the cost of increasing 
the approximation ratio from 3 to 5. In particular we give a proof outline for the following 
theorem. 

Theorem III. For every a G N, there exists an algorithm which, given a graph G and an 
integer k, in 0{c^ ■ nlog^"^ n) time for some c G N either computes a tree decomposition of G 
of width at most S/c + 3 or correctly concludes that tw(G) > k. 
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The algorithm of Theorem |TT] satisfies the conditions of Theorem IIIII for a = 1. We will 
show how one can use the algorithm for a = 1 in order to obtain an algorithm for a = 2. In 
particular we aim at an algorithm which given a graph G and an integer k, in 0{c^ ■ n log log n) 
time for some c G N either computes a tree decomposition of G of width at most 5fe + 3 or 
correctly concludes that tw(G) > k. 

We inspect the 0(cn log n) algorithm for the compression step described in Section [2.3.21 
It uses the data structure of Section [27i] in order to find balanced separators in time 0{c^ logn). 
The algorithm uses 0{c logn) time on each recursive call regardless of the size of the induced 
subgraph of G it is currently working on. When the subgraph we work on is big this is very 
fast. However, when we get down to induced subgraphs of size O(loglogn) the algorithm of 
Robertson and Seymour described in Section l2.ll would spend 0(c (log logn) ^) time in each 
recursive call, while our presumably fast algorithm still spends O(c'^logn) time. This suggests 
that there is room for improvement in the recursive calls where the considered subgraph is very 
small compared to n. 

The overall structure of our 0{c log logn) time algorithm is identical to the structure of the 
O(c'^logn) time algorithm of Theorem HIl The only modifications happen in the compression 
step. The compression step is also similar to the 0{c log n) algorithm described in Section r2.3.2l 
but with the following caveat. The data structure query findNextPin finds the largest component 
where a new pin can be placed, returns a vertex from this component, and also returns the size 
of this component. If a call of findNextPin returns that the size of the largest yet unprocessed 
component is less than logn the algorithm does not process this component, nor any of the 
other remaining components in this recursive call. This ensures that the algorithm is never run 
on instances where it is slow. Of course, if we do not process the small components we do not 
find a tree decomposition of them either. A bit of inspection reveals that what the algorithm 
will do is either conclude that tw(G) > fc or find a tree decomposition of an induced subgraph 
of G' of width at most 3/c + 4 such that for each connected component Gt oi G \ V{G'), (a) 
\Ci\ < logn, (b) |A^(Cj)| < 2fc + 3, and (c) N{Ci) is fully contained in some bag of the tree 
decomposition of G'. 

How much time does it take the algorithm to produce this output? Each recursive call 
takes 0(c logn) time and adds a bag to the tree decomposition of G' that contains some 
vertex which was not yet in V{G'). Thus the total time of the algorithm is upper bounded 
by 0(|y(G')| • c logn). What happens if we run this algorithm, then run the 0(c nlogn) 
time algorithm of Theorem HIl on each of the connected components of G \ V{G')? If either of 
the recursive calls return that the treewidth of the component is more than k then tw(G) > k. 
Otherwise we have a tree decomposition of each of the connected components with width 3A; + 4. 
With a little bit of extra care we find tree decompositions of the same width ofGiL)N{Gi) for each 
component Cj, such that the top bag of the decomposition contains N{Gi). Then all of these 
decompositions can be glued together with the decomposition of G' to yield a decomposition of 
width 3A; + 4 for the entire graph G. 

The running time of the above algorithm can be bounded as follows. It takes 0(|y(G")| • 
c log n) time to find the partial tree decomposition of G' , and 

OiJ2c^\Gi\log\Gi\) < O(cMoglogn-^ICil) < ©(c'^n log logn) 

i i 

time to find the tree decompositions of all the small components. Thus, if |y(G')| < 0(j^^) 

the running time of the first part would be 0{c n) and the total running time would be 
0(c'^n log logn). 

How big can |y(G')| be? In other words, if we inspect the algorithm described in Section [2. 11 
how big part of the graph does the algorithm see before all remaining parts have size less than 
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logn? The bad news is that the algorithm could see almost the entire graph. Specifically if 
we run the algorithm on a path it could well be building a tree decomposition of the path by 
moving along the path and only terminating when reaching the vertex which is log n steps away 
from from the endpoint. The good news is that the algorithm of Reed described in Section [2.21 
will get down to components of size logn after decomposing only ©(j^^) vertices of G. The 
reason is that the algorithm of Reed also finds balanced separators of the considered subgraph, 
ensuring that the size of the considered components drop by a constant factor for each step 
down in the recursion tree. 

Thus, if we augment the algorithm that finds the tree decomposition of the subgraph G' such 
that that it also finds balanced separators of the active component and adds them to the top bag 
of the decomposition before going into recursive calls, this will ensure that |y(G')| < 0{y^^) 
and that the total running time of the algorithm described in the paragraphs above will be 
0(cn log logn). The algorithm of Reed described in Section 12.21 has a worse approximation 
ratio than the algorithm of Robertson and Seymour described in Section 12. 1[ The reason is 
that we also need to add the balanced separator to the top bag of the decomposition. When 
we augment the algorithm that finds the tree decomposition of the subgraph G' in a similar 
manner, the approximation ratio also gets worse. If we are careful about how the change is 
implemented we can still achieve an algorithm with running time ©(c'^n log logn) that meets 
the specifications of Theorem IIIII for a = 2. 

The approach to improve the running time from O(c^nlogn) to ©(c'^n log logn) also works 
for improving the running time from 0{c • nlog^^'n) to 0{c • nlog*-"^ ' n). Running the 
algorithm that finds in 0{c n) time the tree decomposition of the subgraph G' such that all 
components of G \ V{G') have size logn and running the 0{c'^ ■ nlog^"^ n) time algorithm on 
each of these components yields an algorithm with running time 0{c ■ nlog^""^ ' n). 

In the above discussion we skipped over the following issue. How can we compute a small 
balanced separator for the active component in time 0{c logn)? It turns out that also this can 
be handled by the data structure. The main idea here is to consider the dynamic programming 
algorithm used in Section [2.3. II to find balanced separators in graphs of bounded treewidth, and 
show that this algorithm can be turned into a 0{c^ logn) time data structure query. We would 
like to remark here that the implementation of the trick from Section 12.3.11 is significantly more 
involved than the other queries: we need to use the approximate tree decomposition not only 
for fast dynamic programming computations, but also to locate the separation {Lq,Xq, Rq) on 
which the trick is employed. A detailed explanation of how this is done can be found at the end 
of Section 17.4.41 This completes the proof sketch of Theorem IIIII A full proof can be found in 
Section [5l 

2.6 5-approximation in 0{c^n) time. 

The algorithm(s) of Theorem [S] are in fact already 0{c^n) algorithms unless n is astronomically 

fc 

large compared to k. If, for example, n < 2^ then log*^ 'n < k and so 0(cn log*^ ' n) < 
0{c^kn). Thus, to get an algorithm which runs in 0{c^n) it is sufficient to consider the cases 
when n is really, really big compared to k. The recursive scheme of Section 12.31 allows us to 
only consider the case where (a) n is really big compared to k and (b) we have at our disposal 
a tree decomposition TIpx of G of width 0{k). 

For this case, consider the dynamic programming algorithm of Bodlaender and Kloks [15] 
that given G and a tree decomposition TIpx of G of width 0{k) either computes a tree de- 
composition of G of width k or concludes that tw(G) > A: in time 0(2*^('^ 'n). The dynamic 
programming algorithm can be turned into a tree automata based algorithm [26t [2] with run- 
ning time 0(2^ -|- n) if one can inspect an arbitrary entry of a table of size 0(2^ ) in 
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constant time. If n > Q,{2 ) then inspecting an arbitrary entry of a table of size 0(2 ), 

means inspecting an arbitrary entry of a table of size 0{n), which one can do in constant time 

in the RAM model. Thus, when n > (7(2^ ) we can find an optimal tree decomposition in 

time 0{n). When n < 0{2^ ) the 0{c''nlog^ ' n) time algorithm of Theorem [S] runs in time 
0{c kn). This concludes the outline of the proof of Theorem HI A full explanation of how to 
handle the case where n is much bigger than k can be found in Section O 

3 Conclusions 

In this paper we have presented an algorithm that gives a constant factor approximation (with 
a factor 5) of the treewidth of a graph, which runs in single exponential time in the treewidth 
and linear in the number of vertices of the input graph. Here we give some consequences of the 
result, possible improvements and open problems. 

3.1 Consequences, corollaries and future work 

A large number of computational results use the following overall scheme: first find a tree 
decomposition of bounded width, and then run a dynamic programming algorithm on it. Many 
of these results use the linear-time exact algorithm of Bodlaender [9| for the first step. If we aim 
for algorithms whose running time dependency on treewidth is single exponential, however, then 
our algorithm is preferable over the exact algorithm of Bodlaender [9j. Indeed, many classical 
problems like Dominating Set and Independent Set are easily solvable in time 0{c^ ■ n) 
when a tree decomposition of width k is provided. Furthermore, there is an on-going work on 
finding new dynamic programming routines with such a running time for problems seemingly 
not admitting so robust solutions; the fundamental examples are Steiner Tree, Traveling 
Salesman and Feedback Vertex Set [Hj. The results of this paper show that for all these 
problems we may also claim 0{c^ ■ n) running time even if the decomposition is not given to us 
explicitly, as we may find its constant factor approximation within the same complexity bound. 

Our algorithm is also compatible with the celebrated Courcelle's theorem [20|, which states 
that every graph problem expressible in monadic second-order logic (MSOL) is solvable in 
time f{k, \\ip\\) ■ n when a tree decomposition of width k is provided, where (p is the formula 
expressing the problem and / is some function. Again, the first step of applying the Courcelle's 
theorem is usually computing the optimum tree-decomposition using the linear-time algorithm 
of Bodlaender [9]. Using the results of this paper, this step can be substituted with finding an 
approximate decomposition in 0{c^ ■ n) time. For many problems, in the overall running time 
analysis we may thus significantly reduce the factor dependent on the treewidth of the graph, 
while keeping the linear dependence on n at the same time. 

It seems that the main novel idea of this paper, namely treating the tree decomposition as a 
data structure on which logarithmic-time queries can be implemented, can be similarly applied 
to all the problems expressible in MSOL. Extending our results in this direction seems like a 
thrilling perspective for future work. 

Concrete examples where the results of this paper can be applied, can be found also within 
the framework of bidimensionality theory |22^ 123] . In all parameterized subexponential algo- 
rithms obtained within this framework, the polynomial dependence of n in the running time 
becomes linear if our algorithm is used. For instance, it follows immediately that every param- 
eterized minor bidimensional problem with parameter k, solvable in time 2^^*'n on graphs of 
treewidth i, is solvable in time 2^^ 'n on graphs excluding some fixed graph as a minor. 
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3.2 Improvements and open problems 

Our result is mainly of theoretical importance due to the large constant c at the base of the 
exponent. One immediate open problem is to obtain a constant factor approximation algorithm 
for treewidth with running time 0{c^n), where c is a small constant. 

Another open problem is to find more efficient exact FPT algorithms for treewidth. Bod- 
laender's algorithm [9] and the version of Reed and Perkovic both use 0{k^^^ >n) time; the 
dominant term being a call to the dynamic programming algorithm of [TJ]. In fact, no exact 
FPT algorithm for treewidth is known whose running time as a function of the parameter k is 
asymptotically smaller than this; testing the treewidth by verifying the forbidden minors can 
be expected to be significantly slower. Thus, it would be very interesting to have an exact 
algorithm for testing if the treewidth of a given graph is at most k in 2°^^ )-nPw time. 

Currently, the best approximation ratio for treewidth for algorithms whose running time is 
polynomial in n and single exponential in the treewidth is the 3-approximation algorithm from 
Section [H What is the best approximation ratio for treewidth that can be obtained in this 
running time? Is it possible to give lower bounds? 

4 An 0{c^n\ogn) S-approximation algorithm for treewidth 

In this section, we provide formal details of the proof of Theorem HIl 

Theorem IV (Theorem HH restated). There exists an algorithm which, given a graph G and 
an integer k, in 0{c^ ■ nlogn) time for some c G N either computes a tree decomposition of G 
of width at most 3fc + 4 or correctly concludes that tw(G) > k. 

In fact, the algorithm that we present, is slightly more general. The main procedure, Alg-^, 
takes as input a connected graph G, an integer k, and a subset of vertices So such that |5o| < 
2k + 3. Moreover, we have a guarantee that not only G is connected, but G \ 5*0 as well. Alg^ 
runs in 0{c ■ nlogn) time for some c G N and either concludes that tw(G) > k, or returns 
a tree decomposition of G of width < 3fc + 4, such that Sq is the root bag. Clearly, to prove 
Theorem [III we can run Alg^ on every connected component of G separately using 5o = 0. 
Note that computation of the connected components takes 0(|1^| + \E\) = 0{kn) time, since if 
\E\ > kn, then we can safely output that tw(G) > k. 

The presented algorithm Alg^^ uses two subroutines. As described in Section [2l Alg^ uses 
the reduction approach developed by the first author [9J; in short words, we can either apply a 
reduction step, or find an approximate tree decomposition of width 0{k) on which a compression 
subroutine Compress^ can be employed. In this compression step we are either able to find 
a refined, compressed tree decomposition of width at most 3A; + 4, or again conclude that 
tw(G) > k. 

The algorithm Compress^ starts by initializing the data structure (see Section [2] for an 
intuitive description of the role of the data structure), and then calls a subroutine FindTD. This 
subroutine resembles the algorithm of Robertson and Seymour (see Section [2|) : it divides the 
graph using balanced separators, recurses on the different connected components, and combines 
the subtrees obtained for the components into the final tree decomposition. 

4.1 The main procedure Alg^ 

Algorithm Alg^ , whose layout is proposed as Algorithm [H runs very similarly to the algorithm 
of the first author [£j; we provide here all the necessary details for the sake of completeness, but 
we refer to [9] for a wider discussion. 
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First, we apply Lemma F2.7l on graph G for parameter 3A;+4. We either immediately conclude 
that tw(G) > S/c + 4, find a set of I-simplicial vertices of size at least -qtt^-, or a matching of 
size at least -qti^ ■ Note that in the application of Lemma 12.71 we ignore the fact that some of 
the vertices are distinguished as Sq. 

If a matching M of size at least -Qpt^ is found, we employ a similar strategy as in [9j. We 
first contract the matching M to obtain G'; note that if G had treewidth at most k then so does 
G' . Then we apply Alg^^ recursively to obtain a tree decomposition T' of G' of width at most 
2>k + 4, and having achieved this we decontract the matching M to obtain a tree decomposition 
T of G of width at most 6/c + 9: every vertex in the contracted graph is replaced by at most 
two vertices before the contraction. Finally, we call the sub-procedure CompresS]^, which given 
G, 5o, k and the decomposition T (of width 0{k)), either concludes that tw(G) > k, or provides 
a tree decomposition of G of width at most 3k + 4, with Sq as the root bag. CompresS]^ is given 
in details in the next section. 

In case of obtaining a large set X of I-simplicial vertices, we proceed similarly as in [9j. We 
compute the improved graph, remove X from it, apply Alg^^ on Gi\X recursively to obtain its 
tree decomposition T' of width at most 3A; + 4, and finally reintroduce the missing vertices of 
X to obtain a tree decomposition T of G of width at most 3A; + 4 (recall that reintroduction 
can fail, and in this case we may conclude that tw(G) > k). Observe that the decomposition 
T satisfies all the needed properties, with the exception that we have not guaranteed that Sq 
is the root bag. However, to find a decomposition that has Sq as the root bag, we may again 
make use of the subroutine Compress^, running it on input G, So, k and the tree decomposition 
T- Lemma 12.71 ensures that all the described steps, apart from the recursive calls to Alg^ and 
CompresS]^, can be performed in 0{k^^' -n) time. Note that the I-simplicial vertices can safely 
be reintroduced since we used Lemma 12.71 for parameter 3/c + 4 instead of k. 

Let us now analyze the running time of the presented algorithm, provided that the running 
time of the subroutine Compress^ is 0{c^ ■ nlogn) for some c G N. Since all the steps of the 
algorithm (except for calls to subroutines) can be performed in 0{k^^' • n) time, the time 
complexity satisfies the following recurrence relation: 

T{n) < 0(fc^(i) •n) + 0(c^'-n log n) +T ( M -—^jnj ; (3) 

Here G is the constant hidden in the 0-notation in Lemma 12.71 By unraveling the recurrence 
into a geometric series, we obtain that 

T(n) < J^f l-—gj 0(A:^W-n + c^- nlogn) (4) 

i=0 ^ ^ 

= Gk^ ■ 0{k^^^^ • n + c^ • nlogn) = 0{c'l -nlogn), 
for some ci > c. 



4.2 Compression 

In this section we provide the details of the implementation of the subroutine Compress^. The 
main goal is encapsulated in the following lemma. 

Lemma 4.1 (Lemma 12.81 restated). There exists an algorithm which on input G,k,SQ,Tapx, 
where (i) Sq C V{G), \So\ < 2k + 3, (ii) G and G \ Sq are connected, and (Hi) Tapx is a tree 
decomposition of G of width at most 0{k), in 0{c^ -nlogn) time for some c G N either computes 
a tree decomposition T of G with w(T) < 3k + 4 and Sq as the root bag, or correctly concludes 
that tw(G) > k. 
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Input: A connected graph G, an integer k, and Sq C V{G) s.t. |5o| < 2k + 3 and G\ Sq 

is connected. 
Output: A tree decomposition T of G with w(7') < 3k + 4 and Sq as the root bag, or 
conclusion that tw(G) > k. 

Run algorithm of Lemma 12.71 for parameter 3A: + 4 

if Conclusion that tw(G) > 3A; + 4 then 
I return _L 

end 

if G has a matching M of cardinality at least ofraT then 
Contract M to obtain G' . 
r'^Algi(G',fc) /*w(r)<3fc + 4*/ 

if r' = -L then 

return _L 
else 

Decontract the edges of M in T' to obtain T- 

return Compress^ (G, A;, T) 
end 
end 

if G has a set X of at least qJIg-s I-siniplicial vertices then 
Compute the improved graph Gj and remove X from it. 
V ^Algi{Gi\X,k) /*w(r')<3fc + 4*/ 

if r' = -L then 
I return _L 

end 

Reintroduce vertices of X to T' to obtain T. 
if Reintroduction failed then 
I return _L 

else 
I return Compress^ (G, A;, T) 

end 
end 

Algorithm 1: Algj^ 

The subroutine's layout is given as Algorithm [2j Shortly speaking, we first initialize the 
data structure P5, with G,k,SQ,T as input, and then run a recursive algorithm FindTD that 
constructs the decomposition itself given access to the data structure. The decomposition is 
returned by a pointer to the root bag. The data structure interface will be explained in the 
following paragraphs, and its implementation is given in Section [71 We refer to Section [2] for a 
brief, intuitive outline. 

The initialization of the data structure takes 0{c^n) time (see Lemma 17. ip . The time 
complexity of FindTD, given in Section [4.31 is 0{c ■ nlogn). 

4.3 The recursive algorithm FindTD 

Subroutine FindTD works on the graph G with two disjoint vertex sets S and U distinguished. 
Intuitively, S is small (of size at most 2k + 3) and represents the root bag of the tree decompo- 
sition under construction. U in turn, stands for the part of the graph to be decomposed below 
the bag containing S, and is always one of the connected components oi G\S. As explained in 
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Input: Connected graph G, k £ N, a set Sq s.t. \So\ < 2k + 3 and G \ 5*0 is connected, 

and a tree decomposition TIpx with w(7Ipx) < 0{k) 
Output: Tree decomposition of G of width at most 3k + 4 with 5*0 as the root bag, or 
conclusion that tw(G) > k. 

Initiahze data structure VS with G, k, S'o, Tlpx 
return FindTD() 

Algorithm 2: Compress^ (G, /c,T). 

Section [2l we cannot afford storing U exphcitly. Instead, we represent U in the data structure 
by an arbitrary vertex vr (called the pin) belonging to it, and implicitly define U to be the 
connected component of G \ 5 that contains tt. Formally, behavior of the subroutine FindTD is 
encapsulated in the following lemma: 

Lemma 4.2. There exists an algorithm that, given access to the data structure T>S in a state 
such that \S\ <2k + 3, computes a tree decomposition T of G[U U S] of width < 3k + 4 with S 
as a root bag, or correctly reports that that tw(G[U U 5]) > A;. // the algorithm is run on S = ^ 
and U = V{G), then its running time is 0{c ■ nlogn) for some c G N. 

The data structure is initialized with S = Sq and tt set to an arbitrary vertex of G \ 5o ; as we 
have assumed that G\Sq is connected, this gives U = V{G) \ Sq after initialization. Therefore, 
Lemma 14.21 immediatelv yields Lemma [ 



A gentle introduction to the data structure Before we proceed to the implementation 
of the subroutine FindTD, we give a quick description of the interface of the data structure VS: 
what kind of queries and updates it supports, and what is the running time of their execution. 
The details of the data structure implementation will be given in Section [71 

The state of the data structure is, in addition to G, A;, T, three subsets of vertices, 5, X and 
F, and the pin vr with the restriction that tt ^ S. S and vr uniquely imply the set U, defined as 
the connected component of G \ 5 that contains vr. The intuition behind these sets and the pin 
is the following: 

• 5" is the set that will serve as a root bag for some subtree, 

• vr is a vertex which indicates the current active component, 

• U is the current active component, the connected component of G \ 5" containing vr, 

• X is a balanced 5-separator (of G[S UU]) and 

• F IS a set of vertices marking the connected components of G[SU U] \ (SUX) as "finished" . 

The construction of the data structure T>S is heavily based on the fact that we are provided 
with some tree decomposition of width 0{k). Given this tree decomposition, the data structure 
can be initialized in 0{c^ ■ n) time for some c G N. At the moment of initialization we set 
S = X = F = (Ji and vr to be an arbitrary vertex of G. During the run of the algorithm, the 
following updates can be performed on the data structure: 

• insert/remove a vertex to/from S, X, or F; 

• mark/unmark a vertex as a pin vr. 
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All of these updates will be performed in 0{c^ • logn) time for some c G N. 

The data structure provides a number of queries that are used in the subroutine FindTD. 
The running time of each query is 0{c'' ■ logn) for some c G N, and in many cases it is actually 
much smaller. We find it more convenient to explain the needed queries while describing the 
algorithm itself. 

Implementation of FindTD The pseudocode of the algorithm FindTD is given as Algorithm[3l 
Its correctness is proven as Claim 14.31 and its time complexity is proven as Claim 14.41 The 
subroutine is provided with the data structure T>S, and the following invariants hold at each 
time the subroutine is called and exited: 

• SQV{G), \S\ <2k + 3, 

• IT exists, is unique and it ^ S, 

• X = F = and 

• the state of the data structure is the same on exit as it was when the function was called. 

The latter means that when we return, be it a tree decomposition or _L, the algorithm that 
called FindTD will have S, X, F and vr as they were before the call. 

We now describe the consecutive steps of the algorithm FindTD; the reader is encouraged to 
follow these steps in the pseudocode, in order to be convinced that all the crucial, potentially 
expensive computations are performed by calls to the data structure. 

First we apply query findSSeparator, which either finds a ^-balanced 5-separator in G[SUU] 
of size at most fc + 1, or concludes that tw(G) > k. The running time of this query is k^^'. 
If no such separator can be found, by Lemma |2. II we infer that tw{G[S U U]) > k and we can 
terminate the procedure. Otherwise we are provided with such a separator sep, which we add 
to X in the data structure. Moreover, for a technical reason, we also add the pin vr to sep (and 
thus also to X), so we end up with having |sep| < k + 2. 

The next step is a loop through the connected components of G[SUU] \ (S'U sep). This part 
is performed using the query findNextPin. Query findNextPin, which runs in constant time, 
either finds an arbitrary vertex u of a connected component of G[SU U] \ {SUX) that does not 
contain any vertex from F, or concludes that each of these components contains some vertex of 
F. After finding u, we mark u by putting it to F and proceed further, until all the components 
are marked. Having achieved this, we have obtained a list pins, containing exactly one vertex 
from each connected component of G[SL) U]\{SU sep). We remove all the vertices on this list 
from F, thus making F again empty. 

It is worth mentioning that the query findNextPin not only returns some vertex n of a 
connected component of G[SD U]\{SL) sep) that does not contain any vertex from F, but also 
provides the size of this component as the second coordinate of the return value. Moreover, the 
components are being found in decreasing order with respect to sizes. In this algorithm we do 
not exploit this property, but it will be crucial for the linear-time algorithm. 

The set X will no longer be used, so we remove all the vertices of sep from X, thus making 
it again empty. On the other hand, we add all the vertices from sep to S. The new set S 
obtained in this manner will constitute the new bag, of size at most \S\ + |sep| < 3k + 5. We 
are left with computing the tree decompositions for the connected components below this bag, 
which are pinpointed by vertices stored in the list pins. 

We iterate through the list pins and process the components one by one. For each vertex 
u G pins, we set u as the new pin by unmarking the old one and marking u. Note that the 
set U gets redefined and now is the connected component containing considered u. First, we 
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find the neighborhood of U in S. This is done using query findNeighborhood, which in 0{k) 
time returns either this neighborhood, or concludes that its cardinality is larger than 2k + 3. 
However, as X was a ^-balanced S'-separator, it follows that this neighborhood will always 
be of size at most 2k + 3 (a formal argument is contained in the proof of correctness). We 
continue with S D N(U) as our new S and recursively call FindTD in order to decompose the 
connected component under consideration, with its neighborhood in S as the root bag of the 
constructed tree decomposition. FindTD either provides a decomposition by returning a pointer 
to its root bag, or concludes that no decomposition can be found. If the latter is the case, we 
inay terminate the algorithm providing a negative answer. 

After all the connected components are processed, we merge the obtained tree decomposi- 
tions. For this, we use the function build(S', X, C) which, given sets of vertices S and X and 
a set of pointers C, constructs two bags B = S and B' = S L) X, makes C the children of B', 
B' the child of B and returns a pointer to B. This pointer may be returned from the whole 
subroutine, after doing a clean-up of the data structure. 

Invariants Now we show that the stated invariants indeed hold. Initially S = X = F = (/> 
and vr E V{G), so clearly the invariants are satisfied. If no S'-separator is found, the algorithm 
returns without changing the data structure and hence the invariants trivially hold in this 
case. Since both X and F are empty or cleared before return or recursing, X = F = 9 holds. 
Furthermore, as S is reset to olds (consult Algorithm [3] for the variable names used) and the 
pin to old^r before returning, it follows that the state of the data structure is reverted upon 
returning. 

The size of S" = is trivially less than 2k + 3 when initialized. Assume that for some 
call to FindTD we have that |olds| < 2k + 3. When recursing, S is the neighborhood of some 
component C of G[olds U C/] \ (olds U sep) (note that we refer to U before resetting the pin). 
This component is contained in some component C" of G[olds U U] \ sep, and all the vertices 
of olds adjacent to C must be contained in C. Since sep is a 2-balanced olds-separator, we 
know that C" contains at most gloldsl vertices of olds. Hence, when recursing we have that 
\S\ < ||olds| + |sep| = i(2fc + 3) + k + 2 = 2k+^ and, since |5| is an integer, it follows that 
\S\ <2k + 3. 

Finally, we argue that the pin vr is never contained in S. When we obtain the elements of 
pins (returned by query findNextPin) we know that X = sep and the data structure guarantees 
that the pins will be from G[olds U C/] \ (olds U sep). When recursing, S = b CI (olds U sep) 
and vr G pins, so it follows that ir ^ S. Assuming vr ^ olds, it follows that vr is not in S 
when returning, and our argument is complete. From here on we will safely assume that the 
invariants indeed hold. 

Correctness 

Claim 4.3. The algorithm FindTD is correct, that is 

(a) iftw{G) < k, FindTD returns a valid tree decomposition of G[SUU] of width at most 3fc + 4 
and 

(h) i/ FindTD returns _L then tw(G) > k. 

Proof. We start by proving (jb]). Suppose the algorithm returns _L. This happens when at some 
point we are unable to find a balanced S'-separator for an induced subgraph G' = G[SUU\. By 
Lemma |2. II the treewidth of G' is more than k. Hence tw(G) > A; as well. 

To show ((aj) we proceed by induction. In the induction we prove that the algorithm creates 
a tree decomposition, and we therefore argue that the necessary conditions are satisfied, namely 
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Data: Data structure DS 

Output: Tree decomposition of width at most 3fc + 4 oi G[S L) U] with 5 as root bag or 
conclusion that tw(G) > k. 

olds ^ I^^.get^O 
old^^P5.get^() 
sep -^ "DiS.findSSeparatorO 
if sep = _L then 
I return _L /* safe to return: the state not changed */ 

end 

P5.insertx(sep) 
P^-insertxlTr) 
pins -^ 
while {u, I) ^ P5.findNextPin() / ± do 

pins. append{u) 

PiS. insert ir(u) 
end 

P5.clearx() 
VS. clear fO 
DS.mseits^sep) 
bags ^ 
for u G pins do 

VS.setTriu) 

haigs.append(T>S.findNeighborhood{)) 
end 

children -^ 
for u,b £ pins, bags do 

VS.setTriu) 

VS. clear s{) 

VS. insert s{b) 

children. append{FlndTB{)) 
end 

P5.clear5() 
P5.insert5'(olds) 
P5.set7r(old7r) 
if _L G children then 
I return _L /* postponed because of rollback of S and tt */ 

end 

return build(olds, sep, children) 

Algorithm 3: FindTD 

• the bags have size at most 3/c + 5, 

• every vertex and every edge is contained in some bag and 

• for each v € V{G) the subtree of bags containing v is connected. 

The base case is at the leaf of the obtained tree decomposition, namely when [/ C S" U sep. 
Then we return a tree decomposition containing two bags, B and B' where B = {S} and 
B' = {S U sep}. Clearly, every edge and every vertex of G[S UU] = G[S U sep] is contained in 
the tree decomposition. Furthermore, since the tree has size two, the connectivity requirement 
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holds and finally, since IS*! < 2k + 3 (invariant) and sep < k + 2 it follows that jSUsepI < 3/c + 5. 

Note that due to the definition of the base case, the algorithm will find no pins and hence it 

will not recurse further. 

The induction step is as follows. Since U '^ S L) sep, the algorithm have found some 

pins vTi, 7r2, . . . , vr^ and the corresponding components Ci, C2, . . . ,Cd in G[S U U]\ {S U sep). 

Let Ni = N{Ci) n (5 U sep). By the induction hypothesis the algorithm gives us valid tree 

decompositions % of G\Ni U Ci]. Note that the root bag of Ti consists of the vertices in Ni. 

By the same argument as for the base case, the two bags B = S and B' = S U sep that we 

construct have appropriate sizes. 

Let V be an arbitrary vertex oi S UU. If v £ SU sep, then it is contained in B'. Otherwise 

there exists a unique i such that v € Ci. It then follows from the induction hypothesis that v 

is contained in some bag of Ti- 
lt remains to show that the edge property and the connectivity property hold. Let uv be 

an arbitrary edge of G[S L) U]. If n and v both are in S" U sep, then the edge is contained in 

B' . Otherwise, assume without loss of generality that u is in some component Q. Then u and 

V are in Ni U Ci and hence they are in some bag of Ti by the induction hypothesis. 

Finally, for the connectivity property, let v be some vertex in S UU. If v ^ S U sep, then 
there is a unique i such that v £ Ci, hence we can apply the induction hypothesis. So assume 
that V € S D sep = B'. Let A be some bag of T containing v. We will complete the proof 
by proving that there is a path of bags containing v from ^ to i?'. If A is B or B' , then this 
follows directly from the construction. Otherwise there exists a unique i such that j4 is a bag in 
T- Observe that v is in Ni as it is in SU sep. By the induction hypothesis the bags containing 

V in T are connected and hence there is a path of bags containing v from A to the root bag 
Ri of T. By construction B' contains v and the bags B' and Ri are adjacent. Hence there is 
a path of bags containing v from A to B' and as A was arbitrary chosen, this proves that the 
bags containing v form a connected subtree of the decomposition. This concludes the proof of 
Claim 1131 □ 



Complexity 

Claim 4.4. The invocation o/FindTD in the algorithm Compress^ runs in 0{c^ ■ nlogn) time 
for some c G N. 

Proof. We will prove the complexity of the algorithm by first arguing that the constructed tree 
decomposition contains at most 2n bags. Then we will partition the used running time between 
the bags, charging each bag with at most 0{c^ • logn) time. It then follows that that FindTD 
runs in 0{c ■ nlogn) time. 

To bound the number of bags we simply observe that at each recursion step, we add the 
previous pin to S and create two bags. Since a vertex can only be added to S one time during 
the entire process, at most 2n bags are created. 

It remains to charge the bags. For a call C to FindTD, let B and B' be as previously and 
let Ri be the root bag of T. We will charge B' and i?i , . . . , it!^ for the time spent on C. Notice 
that as Ri will correspond to B in the next recursion step, each bag will only be charged by 
one call to FindTD. We charge B' with everything in C not executed in the two loops iterating 
through the components, plus with the last call to findNextPin that returned _L. Since every 
update and query in the data structure is executed in 0{c^ ■ log n) time, and there is a constant 
number of queries charged to B', it follows that B' is charged with 0{c'' ■ logn) time. For each 
iteration in one of the loops we consider the corresponding ttj and charge the bag Ri with the 
time spent on this iteration. As all the operations in the loops can be performed in 0{c^ ■ log n) 
time, each Ri is charged with at most 0{c ■ logn) time. Since our tree decomposition has at 

24 



most 2n bags and each is charged with at most 0{c^ ■ logn) time, it fohows that FindTD runs 
in 0{c • nlogn) time and the proof is complete. D 

5 0{c^n\og^^' n) 5-approxiniation algorithm for treewidth 

In this section we provide formal details of the proof of Theorem IIIII 

Theorem V (Theorem IIIII restated). For every a G N, there exists an algorithm which, given 
a graph G and an integer k, in 0{c ■ nlog*^"^ n) time for some c E N either computes a tree 
decomposition of G of width at most bk + A or correctly concludes that tw(G) > k. 

In the proof we give a sequence of algorithms Alg„ for a = 2, 3, . . .; Alg^ has been already 
presented in the previous section. Each Alg^ in fact solves a slightly more general problem than 
stated in Theorem IIIII in the same manner as Alg^^ solved a more general problem than the 
one stated in Theorem [III Namely, every algorithm Alg^ gets as input a connected graph G, an 
integer k and a subset of vertices Sq such that |5o| < 4/0 + 3 and G\So is connected, and either 
concludes that tw(G) > A: or constructs a tree decomposition of width at most 5fe + 4 with So 
as the root bag. The running time of Alg^ is 0{c^ • nlog*^"^ n) for some c € N; hence, in order 
to prove Theorem IIIII we can again apply Alg^ to every connected component of G separately, 
using 5o = 0. 

The algorithms Alg^ are constructed inductively; by that we mean that Alg^ will call 
AlgQ_^, which again will call Alg^_2) ^^^d all the way until Alg^, which was given in the 
previous section. Let us remark that a closer examination of our algorithms in fact shows that 
the constants c in the bases of the exponents of consecutive algorithms can be bounded by some 
universal constant. However, of course the constant factor hidden in the O-notation depends 
on a. 

In the following we present a quick outline of what will be given in this section. For a = 1, 
we refer to the previous section, and for a > 1, Alg^ and Compress^ are described in this 
section, in addition to the subroutine FindPartialTD. 

• Alg^ takes as input a graph G, an integer k and a vertex set Sq with similar assumptions 
as in the previous section, and returns a tree decomposition T of G of width at most 5/c+4 
with Sq as the root bag. The algorithm is almost exactly as Alg^ given as Algorithm [H 
except that it uses Compress^ for the compression step. 

• Compress^ is an advanced version of CompresS]^ (see Algorithm[2D, it allows Sq to be of size 
up to 4A:+3 and gives a tree decomposition of width at most bk+A in time 0(c-n log^"' n). 
It starts by initializing the data structure, and then it calls FindPartialTD, which returns 
a tree decomposition T' of an induced subgraph G' C G. The properties of G' and T' 
are as follows. All the connected components Ci,...Cp of G \ V{G') are of size less 
than logn. Furthermore, for every connected component Gj, the neighborhood N{Gj) 
in G is contained in a bag of 7"'. Intuitively, this ensures that we are able to construct 
a tree decomposition of Gj and attach it to T' without blowing up the width of T' ■ 
More precisely, for every connected component Gj, the algorithm constructs the induced 
subgraph Gj = G[Gj U N{Gj)] and calls Alg^_^ on Gj, k, and N{Gj). The size of N{Gj) 
will be bounded by 4A:+3, making the recursion valid with respect to the invariants of Alg^,. 
If this call returned a tree decomposition Tj with a root bag N{Gj), we can conveniently 
attach Tj to T'; otherwise we conclude that tw(G[Gj U N{Gj)]) > k so tw(G) > A; as well. 

• FindPartialTD differs from FindTD in two ways. First, we use the fact that when enumer- 
ating the components separated by the separator using query findNextPin, these compo- 
nents are identified in the descending order of cardinalities. We continue the construction 
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of partial tree decomposition in the identified components only as long as they are of 
size at least log n, and we terminate the enumeration when we encounter the first smaller 
component. It follows that all the remaining components are smaller then logn; these 
remainders are exactly the components Ci, . . . Cj, that are left not decomposed by Alg^,, 
and on which Alg^^^^ is run. 

The other difference is that the data structure has a new flag, whatsep, which is set to 
either u or s and is alternated between calls. If whatsep = s, we use the same type 
of separator as FindTD did, namely findSSeparator, but if whatsep = u, then we use 
the (new) query findUSeparator. Query findUSeparator, instead of giving a balanced S- 
separator, provides a |-balanced [/-separator, that is, a separator that splits the whole set 
U of vertices to be decomposed in a balanced way. Using the fact that on every second level 
of the decomposition procedure the whole set of available vertices shrinks by a constant 
fraction, we may for example observe that the resulting partial tree decomposition will 
be of logarithmic depth. More importantly, it may be shown that the total number 
of constructed bags is at most 0(n/ log n) and hence we can spend 0{c^ ■ logn) time 
constructing each bag and still obtain running time linear in n. 

In all the algorithms that follow we assume that the cardinality of the edge set is at most 
k times the cardinality of the vertex set, because otherwise we may immediately conclude that 
treewidth of the graph under consideration is larger than k and terminate the algorithm. 

5.1 The main procedure Alg^^ 

The procedure Alg^ works exactly as Alg^^, with the exception that it applies Lemma 12.71 for 
parameter bk + A instead of 3/c + 4, and calls recursively Alg^, and Compress^ instead of Alg^ 
and Compress^. The running time analysis is exactly the same, hence we omit it here. 

5.2 Compression algorithm 

The following lemma explains the behavior of the compression algorithm Compress^,. 

Lemma 5.1. For every integer a > 1 there exists an algorithm, which on input G,k, So,Tapx, 
where (i) Sq C V{G), \So\ < Ak + 3, (ii) G and G \ Sq are connected, and (Hi) Tapx is a tree 
decomposition of G of width at most 0{k), in 0{c • nlog'^^n) time for some c G N either 
computes a tree decomposition T of G with w{T) < 5k + 4 and Sq as the root bag, or correctly 
concludes that tw(G) > k. 

The outline of the algorithm Compress^, for a > 1 is given as Algorithm^ Having initialized 
the data structure using TIpx, the algorithm asks FindPartialTD for a partial tree decomposition 
T', and then the goal is to decompose the remaining small components and attach the resulting 
tree decompositions in appropriate places of T' ■ 

First we traverse T' in linear time and store information on where each vertex appearing in 
T' is forgotten in T' ■ More precisely, we compute a map forgotten : V{G) — )• V{T') U {-L}, 
where for every vertex t; of G we either store _L if it is not contained in T' , or we remember the 
top-most bag Bi of T' such that v & Bi (the connectivity requirement of the tree decomposition 
ensures that such Bi exists and is unique). The map forgotten may be very easily computed 
via a DFS traversal of the tree decomposition: when accessing a child node i from a parent i' , 
we put f orgotten(u) = i for each v £ Bi\ Bi'. Moreover, for every v G Br, where r is the root 
node, we put f orgotten(7;) = r. Clearly, all the vertices not assigned a value in forgotten in 
this manner, are not contained in any bag of 7"', and we put value _L for them. Let W be the 
set of vertices contained in T' , i.e., W = Ujey(r') ^^■ 
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Before we continue, let us show how the map forgotten will be used. Suppose that we have 
some set Y C W, and we have a guarantee that there exists a node i of T' such that Bi contains 
the whole Y. We claim the following: then one of the bags associated with f orgotten(v) for 

V gY contains the whole Y. Indeed, take the path from i to the root of the tree decomposition 
T' , and consider the last node i' of this path whose bag contains the whole Y. It follows that 
i' = forgotten(t;) for some v gY and Y C Bi', so the claim follows. Hence, we can locate the 
bag containing Y in 0{k'^^^' ■ \Y\) time by testing each of |y| candidate nodes forgotten(u) 
for V GY. 

The next step of the algorithm is locating the vertices which has not been accounted for, 
i.e., those assigned _L by forgotten. The reason each of these vertices has not been put into 
the tree decomposition, is precisely because the size of its connected component C of G \ W, is 
smaller than logn. The neighborhood of this component in G is N{C), and this neighborhood 
is guaranteed to be of size at most 4A; + 3 and contained in some bag of T' (a formal proof of 
this fact will be given when presenting the algorithm FindPartialTD, i.e., in Lemma |5.2|) . 

Let Ci, C2, . . . ,Cphe all the connected components of G\W, i.e., the connected components 
outside the obtained partial tree decomposition T' ■ To complete the partial tree decomposition 
into a tree decomposition, for every connected component Cj, we construct a graph Gj = 
G[Cj U N{Cj)] that we then aim to decompose. These graphs may be easily identified and 
constructed in 0{k^^' ■ n) time using depth-first search as follows. 

We iterate through the vertices of G, and for each vertex v such that f orgotten(t;) = _L and 

V was not visited yet, we apply a depth-first search on v to identify its component C . During this 
depth-first search procedure, we terminate searching and return from a recursive call whenever 
we encounter a vertex from W . In this manner we identify the whole component C, and all the 
visited vertices of W constitute exactly N{C). Moreover, the edges traversed while searching 
are exactly those inside G or between G and N{G). To finish the construction of Gj, it remains 
to identify edges between vertices of N{G). Recall that we have a guarantee that N{G) C W 
and N{G) is contained in some bag of T' . Using the map forgotten we can locate some such 
bag in 0{k^^') time, and in 0{k^^') time check which vertices of N{G) are adjacent in it, thus 
finishing the construction of Gj . Observe that during the presented procedure we traverse each 
edge of the graph at most once, and for each of at most n components G we spend 0{k^^') 
time on examination of N{G). It follows that the total running time is 0{k^''^' • n). 

Having constructed Gj, we run the algorithm klg^_i on Gj using Sq = N{Cj). Note that 
in this manner we have that both Gj and Gj \ Sq are connected, which are requirements of the 
algorithm Alg^_^. If Alg^_^ concluded that tw(Gj) > k, then we can consequently answer that 
tw(G) > k since Gj is an induced subgraph of G. On the other hand, if klg^_i provided us 
with a tree decomposition Tj of Gj having N{Gj) as the root bag, then we may simply attach 
this root bag as a child of the bag of T' that contains the whole N{Gj). Any such bag can be 
again located in 0{k^^^') time using the map forgotten. 

5.2.1 Correctness and complexity 

In this section we prove Lemma 1 5. II and Theorem IIIIl and we proceed by induction on a. To 
this end we will assume the correctness of Lemma 15.21 which will be proved later, and which 
describes behavior of the subroutine FindPartialTD(). 

For the base case, a = 1, we use Compress^^ given as Algorithm [5J When its correctness was 
proved we assumed l^ol < 2/c + 3 and this is no longer the case. However, if Alg^^ is applied with 
\Sq\ < 4A: + 3 it will conclude that tw(G) > k ox give a tree decomposition of width at most 
bk + 4. The reason is as follows; Assume that FindTD is applied with the invariant l^l < 4A; + 3 
instead of 2k + 3. By the same argument as in the original proof this invariant will hold, since 



27 



Input: Connected graph G, k £ N, a set Sq s.t. \So\ < 4/c + 3 and G \ 5*0 is connected, 

and a tree decomposition TIpx with w(7Ipx) < 0{k) 
Output: Tree decomposition of G of width at most 5k + 4 with 5*0 as the root bag, or 
conclusion that tw(G) > k. 

Initiahze data structure VS with G, k, 5*0, Tlpx 
V ^ FindPartialTD() 

if r' = -L then 
I return _L 

end 

Create the map forgotten : V{G) — )• V{T') using a DFS traversal of T' 

Construct components Ci, C2, . . . , Cp of G \ W, and graphs Gj = G[Cj U N{Cj)] for 

j = 1,2,... ,p 

for j = 1,2,. . . ,p do 

Tj ^Alg^_i on Gj,k,N{Cj) 
i{ Tj = -L then 
I return _L 

end 

Locate a node i of T' s.t. N{Cj) C Bi, by checking f orgotten(i;) for each 
V G N{Cj) 
Attach the root of Tj as a child of i 



end 
return T' 



Algorithm 4: Compress^ 



2 (4A; + 3) + A: + 2 < 4k -\- 3. The only part of the correctness (and running time analysis) affected 
by this change is the width of the returned decomposition, and when the algorithm adds the 
separator to S it creates a bag of size at most (4A; + 3) + {k + 2) = 5fc + 5 and hence our argument 
for the base case is complete. For the induction step, suppose that the theorem and lemma 
hold for a — 1. We show that Compress^, is correct and runs in 0{c^ ■ nlog^'^' n) time. This 
immediately implies correctness and complexity of Alg^ , in the same manner as in Section HI 

To prove correctness of Compress^,, suppose that T' is a valid tree decomposition for some 
G' (^ G that we have obtained from FindPartialTD. Observe that if T' = -L, then tw(G) > k 
by Lemma [5.21 Otherwise, let Ci,...,Cp be the connected components of G \ W, and let 
Gj = G[Cj U N{Cj)] for j = 1,2,..., p. Let Tj be the tree decompositions obtained from 
application of the algorithm AlgQ,_]^ on graphs Gj. HTj = J- for any j, we infer that tw(Gj) > k 
and, consequently, tw(G) > k. Assume then that for all the components we have indeed obtained 
valid tree decompositions, with N{Cj) as root bags. It can be easily seen that since N{Cj) 
separates Gj from the rest of G, then attaching the root of Tj as a child of any bag containing 
the whole N{Gj) gives a valid tree decomposition; the width of this tree decomposition is the 
maximum of widths of T and Tj, which is at most 5/c+3. Moreover, if we perform this operation 
for all the components Gj , then all the vertices and edges of the graph will be contained in some 
bag of the obtained tree decomposition. 

We now proceed to the time complexity of Compress^. The first thing done by the algorithm 
is the initialization of the data structure and running FindPartialTD to obtain T . Application 
of FindPartialTD takes 0{c^n) time by Lemma 15. 2[ and so does initialization of the data 
structure (see Section [7]). As discussed, creation of the forgotten map and construction of the 
graphs Gj takes 0{k^^^> ■ n) time. 

Now, the algorithm applies klg^_i to each graph Gj. Let Uj be the number of vertices of 
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Gj. Note that 

p p p 

j=i j=i j=i 

Moreover, as Uj < logn + [Ak + 3), it follows from concavity of t — )• log^"~ ' t that 

log(°-i) Uj < log(°-i)(logn + (4/c + 3)) < log(") n + log(""^)(4fc + 3). 

By the induction hypothesis, the time complexity of AlgQ,_]^ on Gj is 0{c^ ■ Uj log^"~ ' Uj), 
hence we spend 0{cr ■ rij log*^"" ' Uj) time for Gj. Attaching each decomposition Tj to T' can 
be done in 0{k^^^') time. 

Let Ga denote the complexity of Compress^ and Aa the complexity of Alg^,. By applying 
the induction hypothesis we analyze the complexity of Compress^ (we use a constant ci > c to 
hide polynomial factors depending on k): 



Ga{n,k) = 0{c^ -n 
= 0{c^ ■ n 

< 0{c^ ■ n 

= 0(4 ■ n 

< 0{c\ ■ n 



p 
+ ^0(c^-njlog("-i)nj) 

p 
p 



c- • nj(log(") n + log("-^)(4A; + 3))) 



+ ^0(c'=-njlog(" 



n] 



i=i 



+ {bk + 3)n • 0{c'' ■ log(°) n) = 0{c'l • nlog^") n). 



We conclude that Compress^ is both correct and that it runs in 0{c^ ■ nlog^'^' n) time 
for some c G N. The correctness and time complexity 0(cn log'"^ n) of Alg^ follow in the 
same manner as in the previous section. And hence our induction step is complete and the 
correctness of Lemma 12.81 and Theorem IIIII follows. The only assumption we made was that of 
the correctness of Lemma 15.21 which will be given immediately. 

5.3 The algorithm FindPartialTD 

The following lemma describes behavior of the subroutine FindPartialTD. 

Lemma 5.2. There exists an algorithm that, given data structure VS in a state such that 
\S\ < 4A; + 3 i/ whatsep = s or \S\ < 3A; + 2 i/ whatsep = u, in time 0{c n) either concludes 
that tw(G[U U S]) > k, or give a tree decomposition T' of G' Q G[U U S] such that 

• the width of the decomposition is at most 5k + 4 and S is its root bag; 

• for every connected component C of G[U U S]\V {G') , the size of the component is less than 
log n, its neighborhood is of size at most 4k + 3, and there is a bag in the decomposition 
T' containing this whole neighborhood. 

The pseudocode of FindPartialTD is presented as Algorithm [5j The algorithm proceeds 
very similarly to the subroutine FindTD, given in Section SI The main differences are the 
following. 
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• We alternate usage of findSSeparator and findUSeparator between the levels of the recur- 
sion to achieve that the resulting tree decomposition is also balanced. A special flag in 
the data structure, whatsep, that can be set to s or u, denotes whether we are currently 
about to use findSSeparator or findUSeparator, respectively. When initializing the data 
structure we set whatsep = s, so we start with finding a balanced S'-separator. 

• When identifying the next components using query findNextPin, we stop when a com- 
ponent of size less than logn is discovered. The remaining components are left without 
being decomposed. 

The new query findUSeparator, provided that we have the data structure with S and vr distin- 
guished, gives a |-balanced separator of U in G[U] of size at most k + 1. That is, it returns a 
subset Y of vertices of U, with cardinality at most A; + 1, such that every connected component 
of G[U] \ Y has at most ||C/| vertices. If such a separator cannot be found (which is signalized 
by _L), we may safely conclude that tw{G[U]) > k and, consequently tw(G) > k. The running 
time of query findUSeparator is 0{c^ ■ logn). 

We would like to remark that the usage of balanced [/-separators make it not necessary to 
add the pin to the obtained separator. Recall that this was a technical trick that was used in 
Section U] to ensure that the total number of bags of the decomposition was linear. 

5.3.1 Correctness 

The invariants of Algorithm [5] are as for Algorithm [3l except for the size of S, in which case we 
distinguish whether whatsep is s or n. In the case of s the size of S is at most 4A: + 3 and for 
u the size of S is at most 3k + 2. 

If whatsep = u then, since \S\ < 3k + 2 and we add an [/-separator of size at most k + 1 and 
make this our new S, the size of the new S will be at most 4k + 3 and we set whatsep = s. For 
every component C on which we recurse, the cardinality of its neighborhood {S at the moment 
of recursing) is therefore bounded by 4A; + 3. So the invariant holds when whatsep = u. 

We now show that the invariant holds when whatsep = s. Now |olds| < 4A; + 3. We find ^- 
balanced 5-separator sep of size at most k + 1. When recursing, the new S is the neighborhood 
of some component C of G[olds U [/] \ (olds U sep) (note that we refer to U before resetting 
the pin). This component is contained in some component C of G[olds U [/] \ sep, and all the 
vertices of oldg adjacent to C must be contained in C. Since sep is a 2-balanced oldg-separator, 
we know that C contains at most 2 |olds| vertices of olds. Hence, when recursing we have that 
\S\ < ^|olds| + |sep| = |(4A; + 3) + /c + l = 3A;+| and, since IS*! is an integer, it follows that 
\S\ <3k + 2. Hence, the invariant also hold when whatsep = s. 

Note that in both the checks we did not assume anything about the size of the component 
under consideration. Therefore, it also holds for components on which we do not recurse, i.e., 
those of size at most logn, that the cardinalities of their neighborhoods will be bounded by 
4k + 3. 

The fact that the constructed partial tree decomposition is a valid tree decomposition of 
the subgraph induced by vertices contained in it, follows immediately from the construction, 
similarly as in Section 21 A simple inductive argument also shows that the width of this tree 
decomposition is at most 5k + 4: at each step of the construction, we add two bags of sizes at 
most {4k + 3) + (/c + 1) < 5A; + 5 to the obtained decompositions of the components, which by 
inductive hypothesis are of width at most 5k + 4. 

Finally, we show that every connected component of G[S U [/] \ V{G') has size at most 
logn and that the neighborhood of each of these connected component is contained in some 
bag on the partial tree decomposition T' ■ First, by simply breaking out of the loop shown in 
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Data: Data structure DS 

Output: Partial tree decomposition of width at most k of G[S U U] with S as root bag 
or conclusion that tw(G) > k. 

olds ^ VS. get si) 
old^^P5.get^() 
old„ -^ PiS.whatsep 
if PiS.whatsep = s then 

sep -^ D5.findSSeparator() 

"DiS.whatsep -^ u 
else 

sep -^ P5.findUSeparator() 

PiS. what sep -^ s 
end 
if sep = ± then 

DcS.whatsep -^ old„ 

return _L /* safe to return: the state not changed */ 

end 

DiS.insertx (sep) 
pins -^ 
while {u, I) ^ I?5.findNextPin() / ± and / > logn do 

pins. append{u) 

1)5. insert ir(u) 
end 

P5.clearx() 
P5. clear i?() 
I?5.insert5(sep) 
bags ^ 
for u G pins do 

'D5.set7r(ti) 

bags.appen(i(P5.findNeighborhood()) 
end 

children -^ 
for u,b £ pins, bags do 

2?5.set7r(ti) 

P5. clear sO 

P5.insert5'(5) 

children.apj»en(i(FindPartialTD()) 
end 

PiS.whatsep -^ old„ 
P5.clear5() 
PiS. inserts (olds) 
'D5.set,r(old7r) 
if _L G children then 

return _L /* postponed because of rollback of S and tt */ 

end 

return build(olds, sep, children) 

Algorithm 5: FindPartialTD 
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Algorithm [S] at the point we get a pair (tt, Z) such that / < logn, we are guaranteed that the 
connected component of G[S DU]\ sep containing vr has size less than logn, and so does every 
other connected component of G[SL)U] not containing a vertex from F and which has not been 
visited by 2?5.findNextPin(). Furthermore, since immediately before we break out of the loop 
due to small size we add 5 U sep to a bag, we have ensured that the neighborhood of any such 
small component is contained in this bag. The bound on the size of this neighborhood has been 
already argued. 

5.3.2 Complexity 

Finally, we show that the running time of the algorithm is 0{c^ ■ n). The data structure 
operations all take time 0{c'^ logn) and we get the data structure VS as input. 

The following combinatorial lemma will be helpful to bound the number of bags in the 
tree decomposition produced by FindPartialTD. We aim to show that the tree decomposition 
T' contains at most 0(n/logn) bags, so we will use the lemma with /i(i) = u;j/logn, where 
i is a node in a tree decomposition T' and Wi is the number of vertices in G[U] when i is 
added to T'- Having proven the lemma, we can show that the number of bags is bounded by 
0{fi{r)) = 0(n/logn), where r is the root node of T' ■ 

Lemma 5.3. Let T be a rooted tree with root r. Asswne that we are given a measure fi : 
V{T) —7- R with the following properties: 

(i) fi{y) > 1 for every v G V{T), 
(ii) for every vertex v, let fi,f2, ■ ■ ■ ,Vp be its children, we have that J2^=if^i'^i) — /"(^); '^"-^ 

(Hi) there exists a constant < C < 1 such that for for every two vertices v, v' such that v is 
a parent of v' , it holds that ^{v') < C ■ ^{v). 



Then\V{T)\< (l + ^) ;,(r) - 1. 



Proof. We prove the claim by induction with respect to the size of V{T). If |V^(T)| = 1, the 
claim trivially follows from property (i). We proceed to the induction step. 

Let vi,V2,---,Vphe the children of r and let Ti, T2, . . . , Tp be subtrees rooted in t;i, ^2, • • • , t'p, 
respectively. If we apply the induction hypothesis to trees Ti, . . . ,Tp, we infer that for each 

i = 1, 2, ... ,p we have that |F(rj)| < ( 1 + j^ j /u(t'j) — 1. By summing the inequalities we 
infer that: 



\y{T)\ < ^-p+{^ + Y^)jz^'^'^i) 



We now consider two cases. Assume first that p > 2; then: 

\V{T)\ < l-2+(l + ^)f]/.K)<(l + ^)M0-l, 

and we are done. Assume now that p = 1; then 

\V{T)\ < (l + ^)^(^;l)<c(l + ^)MO 

1 + Y^) '^(^) - (2 - C)fi{r) < (1 + r^^ Kr) " 1, 
and we are done as well. D 
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We now prove the following claim. 

Claim 5.4. The partial tree decomposition T' contains at most 42n/logn nodes. 

Proof. Let us partition the set of nodes V{T') into two subsets. At each recursive call of 
FindPartialTD, we create two nodes: one associated with the bag olds, and one associated 
with the bag olds U sep. Let /small be the set of nodes associated with bags olds, and let /large 
the the set of remaining bags, associated with bags olds U sep. As bags are always constructed 
in pairs, it follows that |/smaii| = |/iarge| = ^l^l"/"')!- Therefore, it remains to establish a bound 

on I /small I- 

For a node i G V{T'), let Wi be the number of vertices strictly below i in the tree de- 
composition T', also counting the vertices outside the tree decomposition. Note that by the 
construction it immediately follows that Wi > logn for each i G /small- 

We now partition /small into three parts: I^^^^, /g^'maii' ^^^ ^smln ■ ^!maii consists of all the 
nodes created in recursive calls where what sep = s. /^°ii consists of all the nodes created in 
recursive calls where what sep = u, and moreover the algorithm did not make any more recursive 
calls to FindTD (in other words, all the components turned out to be of size smaller than logn). 
-^smaU consists of all the remaining nodes created in recursive calls where whatsep = u, that is, 
such that the algorithm made at least one more call to FindTD. We aim at bounding the size 
of each of the sets I^^^^^, /"^^[i , and I^^l^^ separately. 

We first claim that |/"i^aul — n/logn. Indeed, we have that the sets of vertices strictly 
below nodes of /"j^^u ^^^ pairwise disjoint. And since any bag in /g^'j^n is a subset of its parent 
and the recursive call to create the bag was made we know that there is at least logn vertices 
below. As their total union is of size at most n, the claim follows. 

We now claim that |/^!^iil — l-^s^maiil- Indeed, if with every node i S /"^^u ^^ associate any 
of its grandchild belonging to 1^^^^^^, whose existence is guaranteed by the definition of /^!^ii' 
we obtain an injective map from /^^u into /|jnair 

We are left with bounding l/smaiil- -^°^ this, we make use of Lemma [5.3[ Recall that vertices 
of /gj^a^ii are exactly those that are in levels whose indices are congruent to 1 modulo 4, where 
the root has level 1; in particular, r S I^y^nw We define a rooted tree T as follows. The vertex 
set of T is /g^maii' ^^"^ ^°^ every two nodes i,i' G /s^maii such that i' is an ancestor of i exactly 
4 levels above (grand-grand-grand-parent), we create an edge between i and i' . It is easy to 
observe that T created in this manner is a rooted tree, with r as the root. 

We can now construct a measure ^ : V{T) — )• M by taking fi{i) = Wi/logn. Let us check 
that fi satisfies the assumptions of Lemma 15.31 for C = |. Property (i) follows from the fact 
that Wi > logn for every i S /small- Property (ii) follows from the fact that the parts of 
the components on which the algorithm recurses below the bags are always pairwise disjoint. 
Property (iii) follows from the fact that between every pair of parent, child in the tree T 
we have used a |-balanced f/-separator. Application of Lemma 15.31 immediately gives that 
l-^smaiil — lOn/logn, and hence |^(T')| < 42n/logn. D 

To conclude the running time analysis of FindPartialTD, we provide a similar charging 
scheme as in Section [H More precisely, we charge every node of T' with 0{c ■ logn) running 
time; Claim [57^ ensures us that then the total running time of the algorithm is then 0{c'^ • n). 

Let B = olds and B' = oldsUsep be the two bags constructed at some call of FindPartialTD. 
All the operations in this call, apart from the two loops over the components, take 0{c^ ■ logn) 
time and are charged to B' . Moreover, the last call of findNextPin, when a component of size 
smaller than log n is discovered, is also charged to B' . As this call takes 0(1) time, B' is charged 
with Olc'^ • logn) time in total. 
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We now move to examining the time spent while iterating through the loops. Let Bj be the 
root bag of the decomposition created for graph Gj . We charge Bj with all the operations that 
were done when processing Gj within the loops. Note that thus every such Bj is charged at 
most once, and with running time 0{c ■ logn). Summarizing, every bag of T' is charged with 
0{c^ ■ logn) running time, and we have at most 42n/logn bags, so the total running time of 
FindPartialTD is 0{c^ ■ n). 

6 An 0{c^n) 5-approxiniation algorithm for treewidth 

In this section we give the main result of the paper. The algorithm either calls Alg^, for a = 2 or 
a version of Bodlaender [9j applying a table lookup implementation of the dynamic programming 
algorithm by Bodlaender and Kloks [T5], depending on how n and k relate. These techniques 
combined will give us a 5-approximation algorithm for treewidth in time single exponential in 
k and linear in n. 

Theorem VI (Theorem [U restated). There exists an algorithm, that given an n-vertex graph 
G and an integer k, in time 2^ 'n either outputs that the treewidth of G is larger than k, or 
constructs a tree decomposition of G of width at most 5k + 4. 

As mentioned above, our algorithm distinguishes between two cases. The first case is when 
n is "sufficiently small" compared to k. By this, we mean that n < 2'^ ° . The other case is 
when this is not the case. For the first case, we can apply Algg and since n is sufficiently small 
compared to k we can observe that log^ ' n = k^^^\ resulting in a 2^^^'n time algorithm. For 
the case when n is large compared to k, we construct a tree automata in time double exponential 
in k. In this case, double exponential in k is in fact also linear in n. This automaton is then 
applied on a nice expression tree constructed from our tree decomposition and this results in 
an algorithm running in time 2^^^>n. 

Lemma 6.1 (Bodlaender and Kloks [15j). There is an algorithm, that given a graph G, an 
integer k, and a nice tree decomposition of G of width at most i with 0(n) bags, either decides 
that the treewidth of G is more than k, or finds a tree decomposition of G of width at most k in 
time 0(20(fc^')n). 

Our implementation with table lookup of this result gives the following: 

Lemma 6.2. There is an algorithm, that given a graph G, an integer k, and a nice tree decom- 
position of G of width at most I = 0{k) with 0[n) hags, either decides that the treewidth of G 

is more than k, or finds a tree decomposition of G of width at most k, in time 0(2 -j-k'^^n), 

for constants cq and ci . 

The proof of Lemma [6.21 will be given later. We first discuss how Lemma [6.21 combined with 
the results in other sections imply the main result of the paper. First we handle the case when 

n < 2^ ° . We then call Algg on each connected component of G separately, with S" = 0. This 
algorithm runs in time 0{c'^ ■ n log logn) = 0{c'^nk^) = 0{c^n) time for some constants c and 



c'. 
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For the remaining of this section we will assume n > 2 . An inspection of Bodlaender's 
algorithm [U] shows that it contains the following parts: 

• A recursive call to the algorithm is made on a graph with c^n vertices, for C3 = 1 — 

1 

• The algorithm of Lemma [6.1 1 is called with ^ = 2A; + 1. 
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• Some additional work that uses time linear in n and polynomial in k. 

The main difference of our algorithm in the case of "large n" is that we replace the call to 
the algorithm of Lemma l6.ll with a call to the algorithm of Lemma 16.21 again with i = 2k + 1. 

At some point in the recursion, instances will have size less than 2^''° . At that point, we call 
Alg2 on this instance; with the main difference that at one level higher in the recursion, the 
algorithm of Lemma 16.21 is called with i = 10k + 9. 

The analysis of the running time is now simple. A call to the algorithm of Lemma 16.2 

OCA k£ 

uses time which is bounded by 0(2 + k'^^n) = 0{n -\- k'^^n), i.e., time linear in n and 

polynomial in k. The total work of the algorithm is thus bounded by a function T(n) that 
fulfills T{n) < T{c3n) + 0{k'^'^n) = 0{k^~^'^'^n), for constant C4, i.e., time polynomial in k and 
linear in n. This proves our main result. What remains for this section is a proof of Lemma 



6.1 Nice expression trees 

The dynamic programming algorithm in |15j is described with help of so called nice tree decom- 
positions. As we need to represented a nice tree decomposition as a labeled tree with the label 
alphabet of size a function of k, we use a slightly changed notion of labeled nice tree decompo- 
sition. The formalism is quite similar to existing formalisms, e.g., the operations on fc-terminal 
graphs by Borie [TU]. 

A labeled terminal graph is a 4-tuple G = {V, E, X, /), with {V, E) a graph, X — )• 1/ a set of 
terminals, and / : X — )• AA an injective mapping of the terminals to non-negative integers, which 
we call labels. A fe-labeled terminal graph is a labeled terminal graph with the maximum label 
at most k, i.e., max^;^^ f{x) < k. Let Ok be the set of the following operations on A;-terminal 
graphs. 

LeaffO: gives a /c-terminal graph with one vertex v, no edges, with v a terminal with label (. 

Introduce£5(G): G = (y,E,X,f) is a /c-terminal graph, i a non-negative integer, and S C 
{1, . . . ,k} a set of labels. If there is a terminal vertex in G with label i, then the operation 
returns G, otherwise it returns the graph, obtained by adding a new vertex v, making v a 
terminal with label i, and adding edges {v, w} for each terminal w £ X with /(if) G S. I.e., we 
make the new vertex adjacent to each existing terminal whose label is in S. 

Forget^(G): Again G = (y,E,X,f) is a A;-terminal graph. If there is no vertex v G X with 
f{v) = i, then the operation returns G, otherwise, we 'turn v into a non-terminal, i.e., we return 
the fe-terminal graph (V, E,X — {v}, /') for the vertex v with f(v) = i, and /' is the restriction 
of / to X - {v}. 

Join{G,H): G = {V,E,X,f) and H = {W,F,Y,g) are fc-terminal graphs. If the range of / 
and g are not equal, then the operation returns G. Otherwise, the result is obtained by taking 
the disjoint union of the two graphs, and then identifying terminals with the same label. 

Note that for given k, O^ is a collection oi k -\- k ■ 2^^ -\- k -\- k"^ operations. When the treewidth 
is k, we work with k -\- 1-terminal graphs. The set of operations mimics closely the well known 
notion of nice tree decompositions (see e.g., [301 [TO] ). 

Proposition 6.3. Suppose a tree decomposition of G is given of width at most k with m bags. 
Then, in time, linear in n and polynomial in k, we can construct an expression giving a graph 
isomorphic to G in terms of operations from O^+i with the length of the expression 0{mk). 

Proof. First build with standard methods a nice tree decomposition of G of width k; this has 
0{km) bags, and 0{m) join nodes. Now, construct the graph H = {V, F), with for all v,w gV , 
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{v, w} £ F, if and only if there is a bag i with v,w €z Xi. It is well known that H is a chordal 
super graph of G with maximum clique size k + 1 (see e.g., [lO])- Use a greedy linear time 
algorithm to find an optimal vertex coloring c oi H (see |28[ Section 4.7].) 

Now, we can transform the nice tree decomposition to the expression as follows: each leaf 
bag that contains a vertex v is replaced by the operation Leafc(t,), i.e., we label the vertex by 
its color in H. We can now replace bottom up each bag in the nice tree decomposition by the 
corresponding operation; as we labeled vertices with the color in H, we have that all vertices in 
a bag have different colors, which ensures that a Join indeed performs identifications of vertices 
correctly. Bag sizes are bounded by /c + 1, so all operations belong to Ofc+i- D 

View the expression as a labeled rooted tree, i.e., each node is labeled with an operation 
from Ojt+i; leaves are labeled with a Leaf operation, and binary nodes have a Join-label. To 
each node of the tree i, we can associate a graph G,, and the graph Gr associated to the root 
node r is isomorphic to G. Call such a labeled rooted tree a nice expression tree of width k. 

6.2 Dynamic programming and finite state tree automata 

The discussion in this paragraph holds for all problems invariant under isomorphism,. Note 
that the treewidth of a graph is also invariant under isomorphisms. We use ideas from the early 
days of treewidth, see e.g., p6l[2]. 

A dynamic programming algorithm on nice tree decompositions can be viewed also as a 
dynamic programming algorithm on a nice expression tree of width k. Suppose that we have 
a dynamic programming algorithm that computes in bottom-up order for each node of the 
expression tree a table with at most r = 0(1) bits per table, and to compute a table, only 
the label of the node (type of operation) and the tables of the children of the node are used. 
We remark that the DP algorithm for treewidth from Bodlaender and Kloks [15] is indeed of 
this form, if we see /c as a fixed constant. Such an algorithm can be seen as a finite state tree 
automaton: the states of the automaton correspond to the at most 2'" = 0(1) different tables; 
the alphabet are the 0(1) different labels of tree nodes. 

To decide if the treewidth of G is at most /c, we first explicitly build this finite state tree 
automaton, and then execute it on the expression tree. For actually building the corresponding 
tree decomposition of G of width at most k, if existing, some more work has to be done, which 
is described later. 

6.3 Table lookup implementation of dynamic programming 

The algorithm of Bodlaender and Kloks [15] builds for each node in the nice tree decomposition 
of table of characteristics: each characteristic represents the 'essential information' of a tree 
decomposition of width at most k of the graph associated with the bag. 

Inspection of the algorithm [TJ] easily shows that the number of different characteristics is 
bounded by 2 ' ' when we are given an expression tree of width l and want to test if the 
treewidth is at most k. (See [151 Definition 5.9].) 

We now use that we represent the vertices in the bag, i.e., the terminals, by a label from 
{1, ...,£ + 1} if £ is the width of the nice expression tree. Thus, we have a set Ck/ (only 
depending of k and t) that contains all possible characteristics that belong to a table; each 
table is just a subset of G^^i, i.e., an element of V{Gk/)- I.e., we can view the decision variant of 
the dynamic programming algorithm of Bodlaender and Kloks as a finite state tree automaton 
with alphabet O^+i and state set V{Gk.e)- 

The first step of the algorithm is now to explicitly construct this finite state tree automaton. 
We can do this as follows. Enumerate all characteristics in V{Ck/), and number them ci, . . . , c^, 
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s = i^v^-f- ). Enumerate all elements ofV{Ck/), and number them ti, . . . , t^/, s' = 2^ ; store 

with ti the elements of its set. 

Then, we compute a transition function F : O^+i x {1, . . . , s'} x {1, . . . , s'} — )• {1, . . . , s'}. In 
terms of the finite state automaton view, F gives the state of a node given its symbol and the 
states of its children. (If a node has less than two children, the third, and possibly the second 
argument are ignored.) In terms of the DP algorithm, if we have a tree node i with operation 
o S O^+i, and the children of i have tables corresponding to ta and tp, then F(o, a, /3) gives 
the number of the table obtained for i by the algorithm. To compute one value of F, we just 
execute one part of the algorithm of Bodlaender and Kloks [15]. Suppose we want to compute 
F(o, a, /?). If o is a shift operation, then a simple renaming suffices. Otherwise, build the tables 
Ta and Tp corresponding to ta and tp, and execute the step of the algorithms from [15] for a 
node with operation o whose children have tables Tq and T^. (If the node is not binary, we 
ignore the second and possibly both tables.) Then, lookup what is the index of the resulting 
table; this is the value of F{o, a, /?). 

We now estimate the time to compute F. We need to compute 0(2^ • i ■ s'"^) = 0(2^ ) 

values; each executes one step of the DP algorithm and does a lookup in the table, which is 

easily seen to be bounded again by 0(2 ), so the total time to compute F is still bounded 

by 0(2^ ). To decide if the treewidth of G is at most k, given a nice tree decomposition of 

width at most £, we thus carry out the following steps: 

• Compute F. 

• Transform the nice tree decomposition to a nice expression tree of width i. 

• Compute bottom- up (e.g., in post-order) for each node i in the expression tree a value Vi, 
with a node i labeled by operation o G O^+i and children with values Vj^, Vj^, we have 
Vi = F{o,Vj^,Vj^). If V has less than two children, we take some arbitrary argument for 
the values of missing children. In this way, Vi corresponds to the table that is computed 
by the DP algorithm of [15] . 



• If the value Vr for the root of the expression tree corresponds to the empty set, then the 
treewidth of G is more than /c, otherwise the treewidth of G is at most k. (See [151 Section 
4.6 and 5.6].) 

If our decision algorithm decides that the treewidth of G is more than /c, we reject, and we 
are done. Otherwise, we need to do additional work to construct a tree decomposition of G of 
width at most /c, which is described next. 

6.4 Constructing tree decompositions 

After the decision algorithm has determined that the treewidth of G is at most k, we need to 
find a tree decomposition of G of width at most k. Again, the discussion is necessarily not self 
contained and we refer to details given in ^5, Chapter 6]. 

Basically, each table entry (characteristic) in the table of a join node is the result of a 
combination of a table entry in the table of the left child and a table entry of the table of the 
right child. Similarly, for nodes with one child, each table entry is the result of an operation to 
a table entry in the table of the child node. Leaf nodes represent a graph with one vertex, and 
we have just one tree decomposition of this graph, and thus one table entry in the table of a 
leaf node. 

If we select a characteristic of the root bag, this recursively defines one characteristic from 
each table. In the first phase of the construction, we make such a selection. In order to do 

37 



this, we first pre-compute another function g, that helps to make this selection, g has four 
arguments: an operation from O^+i, the index of a characteristic (a number between 1 and 
s), and the indexes of two states (numbers between 1 and s' = 2*). As value, g yields _L or 
a pair of two indexes of characteristics. The intuition is as follows: suppose we have a node i 
in the nice expression tree labeled with o, an index Cj of a characteristic of a (not yet known) 
tree decomposition of Gi, and indexes of the tables of the children of i, say tj^ and tj2- Now, 
g{o,Ci,tjj^,tj2) should give a pair (cj-^jCj^) such that q is the result of the combination of Cj^ 
and Cj2 (in case that o is the join operation) or of the operation as indicated above to Cj-^ (in 
case o is an operation with one argument; Cj^ can have any value and is ignored). If no such 
pair exists, the output is _L. 

To compute g, we can perform the following steps for each 4-tuple o, q , tj-^ , tj^ ■ Let 5*1 C Ck/ 
be the set corresponding to tj-^, and ^2 C C^^e be the set corresponding to tj^. For each c £ Si 
and c' G 5*2, see if a characteristic c and a characteristic c' can be combined (or, in case of a 
unary operation, if the relevant operation can be applied to c) to obtain ci. If we found a pair, 
we return it; if no combination gives ci, we return _L. Again, in case of unary operations o, we 
ignore c'. We do not need g in case o is a leaf operation, and can give any return values in such 

cases. One can easily see that the computation of g uses again 2 time. 

The first step of our construction phase is to build g, as described. After this, we select 
a characteristic from Ck/ for each node in the nice expression tree, as follows. As we arrived 
in this phase, the state of the root bag corresponds to a nonempty set of characteristics, and 
we take an arbitrary characteristic from this set (e.g., the first one from the list). Now, we 
select top-down in the expression tree (e.g., in pre-order) a characteristic. Leaf nodes always 
receive the characteristic of the trivial tree decomposition of a graph with one vertex. In all 
other cases, if node i has operation o and has selected characteristic c, the left child of i has 
state ij^ and the right child of i has state tj^ (or, take any number, e.g., 1, if i has only one 
child, i.e., o is a unary operation), look up the precomputed value of 5(0, c, ij+i, t^j). As c is a 
characteristic in the table that is the result of F(o, tj+i, tjj)) 5 7^ -L) so suppose g is the pair 
{c',c"). We associate c' as characteristic with the left child of i, and (if i has two children) c" 
as characteristic with the right child of i. 

At this point, we have associated a characteristic with each node in the nice expression 
tree. These characteristics are precisely the same as the characteristics that are computed in 
the constructive phase of the algorithm from [15i Section 6], with the sole difference that we 
work with labeled terminals instead of the 'names' of the vertices (i.e., in [15j . terminals / bag 
elements are identified as elements from V). 

From this point on, we can follow without significant changes the algorithm from [15^ Section 
6]: bottom-up in the expression tree, we build for each node i, a tree decomposition of Gi whose 
characteristic is the characteristic we just selected for i, together with a number of pointers 
from the characteristic to the tree decomposition. Again, the technical details can be found 
in [15], our only change is that we work with terminals labeled with integers in {1, . . . ,1 + 1} 
instead of bag vertices. 

At the end of this process, we obtain a tree decomposition of the graph associated with the 
root bag Gr = G whose characteristic belongs to the set corresponding to the state of r. As we 
only work with characteristics of tree decompositions of width at most k, we obtained a tree 
decomposition of G of width at most k. 

All work we do, except for the pre-computation of the tables of F and g is linear in n and 
polynomial in k; the time for the pre-computation does not depend on n, and is bounded by 

2 . This ends the description of the proof of Lemma 16.21 



38 



7 A data structure for queries in 0(c^ log n) time 

7.1 Overview of the data structure 

Assume we are given a tree decomposition {{Bi \ i G I},T = {I,F)) of G of width 0{k). 
First we turn our tree decomposition into a tree decomposition of depth O(logn), keeping the 
width to t = 0{k), by the work of Bodlaender and Hagerup [H]. Furthermore, by standard 
arguments we turn this decomposition into a nice tree decomposition in 0(t '^^ • n) time, that 
is, a decomposition of the same width and satisfying following properties: 

• All the leaf bags, as well as the root bag, are empty. 

• Every node of the tree decomposition is of one of four different types: 

— Leaf node: a node i with Bi = 9 and no children. 

— Introduce node: a node i with exactly one child j such that Bi = Bj U {v} for 
some vertex v ^ Bj; we say that v is introduced in i. 

— Forget node: a node i with exactly one child j such that Bi = Bj \ {v} for some 
vertex v G Bi; we say that v is forgotten in i. 

— Join node: a node i with two children ji,J2 such that Bi = Bj-^ = Bj^. 

The standard technique of turning a tree decomposition into a nice one includes (i) adding 
paths to the leaves of the decomposition on which we consecutively introduce the vertices of 
corresponding bags; (ii) adding a path to the root on which we consecutively forget the vertices 
up to the new root, which is empty; (iii) introducing paths between every non-root node and its 
parent, on which we first forget all the vertices that need to be forgotten, and then introduce 
all the vertices that need to be introduced; (iv) substituting every node with d > 2 children 
with a balanced binary tree of O(logd) depth. It is easy to check that after performing these 
operations, the tree decomposition has depth at most O(tlogn) and contains at most 0{t ■ n) 
bags. Moreover, using folklore preprocessing routines, in 0{t^^^' • n) time we may prepare 
the decomposition for algorithmic uses, e.g., for each bag compute and store the list of edges 
contained in this bag. We omit here the details of this transformation and refer to Kloks |SOJ. 

In the data structure, we store a number of tables: three special tables that encode general 
information on the current state of the graph, and one table per each query. The information 
stored in the tables reflect some choice of subsets of V , which we will call the current state of 
the graph. More precisely, at each moment the following subsets will be distinguished: S,X,F 
and a single vertex vr, called the pin. The meaning of these sets is described in Section HI On 
the data structure we can perform the following updates: adding/removing vertices to S, X, F 
and marking/unmarking a vertex as a pin. In the following table we gather the tables used by 
the algorithm, together with an overview of the running times of updates. The meaning of the 
table entries uses terminology that is described in the following sections. 

The following lemma follows from each of the entries in the table below, and will be proved 
in this section: 

Lemma 7.1. The data structure can be initialized in 0{c n) time. 
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Table 


Meaning 


Update 


Initialization 


Pi 


Boolean value vr G Wj 


O(t-logn) 


0{t ■ n) 


C[i][iS^,U,)] 


Connectivity information on C/f^* 


0(3*-t^W-logn) 


0(3* • t^W . n) 


CardU[i][{Si,Ui)] 


Integer value jt/f^* n Wj] 


0(3*-t^W-logn) 


0(3* • t^(i) • n) 


TlmS^,U,)] 


Table for query findNeighborhood 


0(3*-fc^W-logn) 


0(3* • fc^(i) • n) 


T2mS^,U^)M 


Table for query findS Separator 


0(9*-A;^W-logn) 


0(9* • A;^(i) • n) 


TsmS„U^,X,,F,)] 


Table for query findNextPin 


0(6*-t^W-logn) 


0(6* • t^(i) • n) 


TM{S^,U^)M 


Table for query findUSeparator 


0(5*-A;^W-logn) 


0(5* • A;^(i) • n) 



We now proceed to description of the description of the table P, and then to the two tables 
C and CardU that handle the important component U. The tables Ti,T2,T2, are described 
together with the description of realization of the corresponding queries. Whenever describing 
the table, we argue how the table is updated during updates of the data structure, and initialized 
in the beginning. 

7.2 The table P 

In the table P, for every node i of the tree decomposition we store a boolean value P[i] equal to 
(tt G Wi). We now show how to maintain the table P when the data structure is updated. The 
table P needs to be updated whenever the pin vr is marked or unmarked. Observe, that the 
only nodes i for which the information whether tt G Wi changed, are the ones on the path from 
Ttt to the root of the tree decomposition. Hence, we can simply follow this path and update 
the values. As the tree decomposition has depth O(tlogn), this update can be performed in 
0{t ■ log n) time. As when the data structure is initialized, no pin is assigned, P is initially filled 
with _L. 

7.3 Maintaining the important component U 

Before we proceed to the description of the queries, let us describe what is the reason of intro- 
ducing the pin vr. During the computation, the algorithm recursively considers smaller parts of 
the graph, separated from the rest via a small separator: at each step we have distinguished set 
S and we consider only one connected component U of G\S. Unfortunately, we cannot afford 
recomputing the tree decomposition of U at each recurrence call, or even listing the vertices of 
U. Therefore we employ a different strategy for identification of U. We will distinguish one 
vertex of [/ as a representative pin tt, and U can then be defined as the set of vertices reachable 
from TT in G \ S*. Instead of recomputing U at each recursive call we will simply change the pin. 

In the tables, for each node i of the tree decomposition we store entries for each possible 
intersection of U with Bi, and in this manner we are prepared for every possible interaction of 
U with Gj. In this manner, changing the pin can be done more efficiently. Information needs to 
be recomputed on two paths to the root in the tree decomposition, corresponding the previous 
and the next pin, while for subtrees unaffected by the change we do not need to recompute 
anything as the tables stored there already contain information about the new U as well — as 
they contain information for every possible new U. As the tree decomposition is of logarithmic 
depth, the update time is logarithmic instead of linear. 

We proceed to the formal description. We store the information about U in two special 
tables: C and CardU. As we intuitively explained, tables C and CardU store information 
on the connectivity behavior in the subtree, for every possible interaction of U with the bag. 
Formally, for every node of the tree decomposition i we store an entry for every member of the 
family of signatures of the bag Bi. A signature of the bag Bi is a pair (Si, Ui), such that Si, Ui 
are disjoint subsets of Bi. Clearly, the number of signatures is at most 3'^"-'. 
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Let i be a node of the tree decomposition. For a signature (j) = {Si, Ui) of Bi, let 5|^* = 
5i U (5n Wi) and Uf^^ consists of all the vertices reachable in Gi \ Sf^^ from Ui or vr, providing 
that it belongs to Wi. Sets S'P* and Uf^^ are called extensions of the signature (j); note that 
given Si and C/j, the extensions are defined uniquely. We remark here that the definition of 
extensions depend not only on (p but also on the node i; hence, we will talk about extensions of 
signatures only when the associated node is clear from the context. 

We say that signature (j) of Bi with extensions Sf^^ and U'?^'^ is valid if it holds that 

(i) [/-t nBi = Ui, 

(ii) if fj 7^ and vr G Wi (equivalently, P[i] is true), then the component of G[L/jP^''] that 
contains tt contains also at least one vertex of Ui. 

Intuitively, invalidity means that </> cannot contain consistent information about intersection of 
U and Gi. The second condition says that we cannot fully forget the component of vr, unless 
the whole C/f^* is already forgotten. 

Formally, the following invariant explains what is stored in tables G and GardU: 

• if (^ is invalid then (7[i][0] = GardU[i][(l)] = _L; 

• otherwise, C[i][0] contains an equivalence relation R consisting of all pairs of vertices 
(a, 6) e Ui that are connected in Gj[[/f''*], while GardU[i][(l)\ contains jf/f"^* r\Wi\. 

Note that in this definition we actually ignore the information about alignment of vertices 
of Bi to set S, F, X in the current state of the graph: the stored information depends only 
on the alignment of forgotten vertices and the signature of the bag that overrides the actual 
information in the current state. In this manner we are prepared for possible changes in the 
data structure, as after an update some other signature will reflect the current state of the 
graph. Moreover, it is clear from this definition that during the computation, the alignment of 
every vertex v in the current state of the graph is being checked only in the single node r^ when 
this vertex is being forgotten; we use this property heavily to implement the updates efficiently 
enough. 

We now explain how for every node i, values in tables G[i] and GardU[i] can be computed 
using entries for the children of i. These formulas will be crucial both for implementing updates 
and initialization. We consider different cases, depending on the type of node i. 

Case 1: Leaf node. If i is a leaf node then C[i][(0,0)] = and CardC/[i][(0,0)] = 0. 

Case 2: Introduce node. Let i be a node that introduces vertex v, and j be its only child. 
Consider some signature (p = {Si, Ui) of Bi; we would like to compute Ri = G[i][(j)]. Let (p' be a 
natural projection of (p onto Bj, that is, cp' = {Si D Bj, Ui D Bj). Let Rj = G[j] [(p']. We consider 
some sub-cases, depending on the alignment of v in (p. 

Case 2.1: v £ Si. If we introduce a vertex from Si, then it follows that extensions of 
Ui = Uj are equal. Therefore, we can put C[i][0] = C[j][0'] and GardU[i][(p] = GardU[j][p']. 

Case 2.2: v £ Ui. In the beginning we check whether conditions of validity are not violated. 
First, if V is the only vertex of Ui and P[i] = T, then we simply put C [?][(/>] = _L: condition (ii) 
of validity is violated. Second, we check whether v is adjacent only to vertices of Sj and Uj; if 
this is not the case, we put C[i][(/>] = _L as condition (i) of validity is violated. 

If the validity checks are satisfied, we can infer that the extension Uf^^ of Ui is extension 
C/?^* of Uj with V added; this follows from the fact that Bj separates v from Wj, so the only 
vertices of Uf^^ adjacent to v are already belonging to Uj. Now we would like to compute 
the equivalence relation Ri out of Rj. Observe that Ri should be basically Rj augmented by 
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connections introduced by the new vertex v between its neighbors in Bj. Formally, Ri may be 
obtained from Rj by merging equivalence classes of all the neighbors of v from Uj, and adding 

V to the obtained equivalence class; if v does not have any neighbors in Uj, we put it as a new 
singleton equivalence class. Clearly, CardU[i][(j)] = CardU\j][4>']. 

Case 2.3: v & Bi \ (Si U Ui). We first check whether the validity constraints are not 
violated. As v is separated from Wj by Bj, the only possible violation introduced by v is that 

V is adjacent to a vertex from Uj. In this situation we put C[i][(^] = Car(iC/[i][0] = _L, and 
otherwise we can put C[i][(^] = C[j][0'] and CardU[i][(p] = CardU[j][<l)'], because extensions of 
(p and (j)' are equal. 

Case 3: Forget node. Let i be a node that forgets vertex w, and j be its only child. 
Consider some signature (j) = {Si, Ui) of Bi and define extensions Sf^^, Uf^^ for this signature. 
Observe that there is at most one valid signature </>' = {Sj, Uj) of Bj for which S*^^* = Sf^^ and 
Uj^^ = Uf^^ , and this signature is simply (p with w added possibly to Si or Ui, depending whether 
it belongs to Sf^'^ or ^f^*: the three candidates are (j)s = {Si U {w},Ui), cpu = {Si,Ui U {w}) 
and (pQ = {Si, Ui). Moreover, if (p is valid then so is (p' . Formally, in the following manner we 
can define signature (p' , or conclude that (j) is invalid: 

• if w £ S, then (j)' = (ps; 

• otherwise, if w = n then (p' = (pu; 

• otherwise, we look into entries C[j] [(/>[/] and C[j][(/)o]- If 

(i) C[j][0;7] = C[j][0o] = -L then (j) is invalid, and we put C[i][0] = CardU[i\\<p\ = _L; 

(ii) if C[j][0[/] = _L or C[j][(^o] = -L, we take </>' = 4>q or </>' = (pjj, respectively; 

(iii) if C[j] [(pu] / -L and C[j] [(po] / _L, it follows that w must be a member of a component 
of Gi \ Sf^'^ that is fully contained in Wi and does not contain vr. Hence we take 

<P' = <Po- 

The last point is in fact a check whether w G Uf^^: whether w is connected to a vertex from Ui 
in Gi, can be looked up in table G[j] by adding or not adding w to Ui, and checking the stored 
connectivity information. If w G Sf^^ or w G C/?^*, we should be using the information for the 
signature with Si or Ui updated with w, and otherwise we do not need to add w anywhere. 

As we argued before, if (p is valid then so does cp' , hence if C[j] [(/)'] = _L then we can take 
C[i][(/)] = GardU[i][(p] = _L. On the other hand, if cp' is valid, then the only possibility for (j) 
to be invalid is when condition (ii) cease to be satisfied. This could happen only if (j)' = (pjj 
and ty is in a singleton equivalence class of C[j] [</>'] (note that then the connected component 
corresponding to this class needs to necessarily contain vr, as otherwise we would have (j)' = <Po)- 
Therefore, if this is the case, we put C[i][(^] = GardU[i][(p] = ±, and otherwise we conclude that 
(p is valid and move to defining C[i][0] and GardU[i][(p]. 

Let now Rj = G[j][(p']. As extensions of (p' and (p are equal, it follows directly from the 
maintained invariant that Ri is equal to Rj with w removed from its equivalence class. Moreover, 
GardU[i][(pi] is equal to GardU[j][(p'], possibly incremented by 1 if we concluded that <p' = (pfj. 

Case 4: Join node. Let i be a join node and ji,J2 be its two children. Consider some 
signature </> = {Si,Ui) of Bi. Let (pi = {Si,Ui) be a signature of Bj-^ and 4>2 = {Si,Ui) be a 
signature of Bj^. From the maintained invariant it follows that C[i][0] is a minimum transitive 
closure of C[ji][<?^i] U C[j2][02]; or _L if any of these entries contains _L. Similarly, GardU[i][(pi] = 
GardU[h][cPi] + GardU[J2][(P2]. 
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We now explain how to update tables C and CardU in 0(3* • t^^^> • logn) time. We perform 
a similar strategy as with table P: whenever some vertex v is included or removed from S", or 
marked or unmarked as a pin, we follow the path from r^ to the root and fully recompute the 
whole tables C, CardU in the traversed nodes using the formulas presented above. At each step 
we recompute the table for some node using the tables of its children; these tables are up to date 
since they did not need an update at all, or were updated in the previous step. Observe that 
since the alignment of v in the current state of the graph is accessed only in computation for r^, 
the path from r„ to the root of the decomposition consists of all the nodes for which the tables 
should be recomputed. Note also that when marking or unmarking the pin tt, we must first 
update P and then C and CardU. The update takes 0(3* • t^^^' ■ logn) time: re-computation 
of each table takes 0(3* • t^^') time, and we perform O(tlogn) re-computations as the tree 
decomposition has depth O(tlogn). 

Similarly, tables C and CardU can be initialized in 0(3* • t^^' ■ n) time by processing the 
tree in a bottom-up manner: for each node of the tree decomposition, in 0(3* • t^^^') time we 
compute its table based on the tables of the children, which were computed before. 

7.4 Queries 

In our data structure we store one table per each query. When describing every query, we first 
introduce the formal invariant on storage of table's entry, and how this stored information can 
be computed based on the entries for children. We then shortly discuss performing updates 
and initialization of the tables, as they are in all the cases based on the same principle as with 
tables C and CardU . Queries themselves can be performed by reading a single entry of the data 
structure, with the exception of query findUSeparator, whose implementation is more complex. 

7.4.1 Query findNeighborhood 

We begin the description of the queries with the simplest one, namely findNeighborhood. This 
query lists all the vertices of S that are adjacent to U . In the algorithm we have an implicit 
bound on the size of this neighborhood, which we can use to cut the computation when the 
accumulated list grows too long. We use i to denote this bound; in our case we have that 
l = 0{k). 



findNeighborhood 

Output: A list of vertices of N(U) D S, or marker 'h' if their number is larger than 

Time: 0{i) 



Let i be a node of the tree decomposition, let 4> = {Si,Ui) be a signature of Bi, and let 
C/f^*,5f^* be extensions of this signature. In entry Ti[i][(/)] we store the following: 

• if (^ is invalid then Ti['i][0] = _L; 

• otherwise Ti [i] [0] stores the list of elements of N(Uf^^)r] Sf^^ if there is at most i of them, 
and Kl if there is more of them. 

Note that the information whether (p is invalid can be looked up in table C. The return value 
of the query is stored in r[r][(0, 0)]. 

We now present how to compute entries of table Ti for every node i depending on the entries 
of children of i. We consider different cases, depending of the type of node i. For every case, 
we consider only signatures that are valid, as for the invalid ones we just put value _L. 

Case 1: Leaf node. If i is a leaf node then Ti[i][(0,0)] = 0. 
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Case 2: Introduce node. Let i be a node that introduces vertex v, and j be its only child. 
Consider some signature (j) = {Si,Ui) of Bi; we would like to compute Ti[i][(^] = Lj. Let (p' be 
a natural intersection of (j) with Bj, that is, (j)' = {Si D Bj,Ui D Bj). Let Ti[j][(/)'] = Lj. We 
consider some sub-cases, depending on the alignment of v in (j). 

Case 2.1: v £ Si. If we introduce a vertex from Si, we have that [/-extensions of (p and 
<j)' are equal. It follows that Li should be simply list Lj with v appended if it is adjacent to 
any vertex of Uj = Ui. Note here that v cannot be adjacent to any vertex of f/?^* \ C/j, as Bj 
separates v from Wj. Hence, we copy the list Lj and append v if it is adjacent to any vertex of 
Uj and Lj ^ ^. However, if the length of the new list exceeds the i bound, we replace it by Kl. 
Note that copying the list takes 0{i) time, as its length is bounded by i. 

Case 2.2: v £ Ui. If we introduce a vertex from U, then possibly some vertices of Si gain 
a neighbor in f/?^*. Note here that vertices of Sf^^ \Si are not adjacent to the introduced vertex 
V, as Bj separates v from Wj. Hence, we copy list Lj and append to it all the vertices of Si 
that are adjacent to v, but were not yet on Lj. If we exceed the i bound on the length of the 
list, we put Kl instead. Note that both copying the list and checking whether a vertex of Si is 
on it can be done in 0{i) time, as its length is bounded by i. 

Case 2.3: v £ Bi\ {SiD Ui). In this case extensions of (j) and (p' are equal, so it follows 
from the invariant that we may simply put T[z][(/)] = T[j] [(/>']. 

Case 3: Forget node. Let i be a node that forgets vertex w, and j be its only child. Consider 
some signature (j) = iSi,Ui) ol Bi. Define (p' in the same manner as in the Forget step in the 
computation of C. As extensions of (p and cp' are equal, it follows that Ti[z][0] = Ti[j] [(/>']. 

Case 4: Join node. Let i be a join node and ji,J2 be its two children. Consider some 
signature (p = (SijU) of Bi. Let (pi = {Si,Ui) be a signature of Bj-^ and (p2 = {Si,Ui) be a 
signature of Bj^. It follows that Ti [?][(/)] should be the merge of lists Ti[ji][i;Ai] and Ti [J2] [(/)2] > 
where we remove the duplicates. Of course, if any of these entries contains Kl, we simply put lEl. 
Otherwise, the merge can be done in 0{i) time due to the bound on lengths of ri[ji][^i] and 
rib2][i;^2]) and if the length of the result exceeds the bound i, we replace it by lEl. 

Similarly as before, for every addition/removal of vertex v to/from S, or marking/unmarking 
t; as a pin, we can update table Ti in 0(3* • fc '^^ • logn) time by following the path from r„ 
to the root and recomputing the tables in the traversed nodes. Also, Ti can be initialized in 
0(3* • k^^' • n) time by processing the tree decomposition in a bottom-up manner and applying 
the formula for every node. Note that updating/initializing table Ti must be performed after 
updating/initializing tables P and C. 

7.4.2 Query findSSeparator 

We now move to the next query, namely finding a balanced 5-separator. By Lemma 12.11 as 
G[U U S] has treewidth at most k, such a ^-balanced 5-separator of size at most k + 1 always 
exists. We therefore implement the following query. 



findSSeparator 

Output: A list of elements of a ^-balanced S-separator of G[U U S] of size at most k + 1, 

or _L if no such exists. 

Time: 0(tO(i)) 



Before we proceed to the implementation of the query, we show how to translate the problem 
of finding a S-balanced separator into a partitioning problem. 
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Lemma 7.2 (Lemma 12.91 restated). Let G be a graph and S C V{G). Then a set X is a 
balanced S -separator if and only if there exists a partition (Mi, M2, M3) of y{G) \ X, such that 
there is no edge between Mi and Mj for i 7^ j, and |Mj r\ S\ < \S\/2 for i = 1,2, 3. 



The following combinatorial observation is crucial in the proof of Lemma 12.91 

Lemma 7.3. Let 01,02, ■■■ ,ap be non-negative integers such that X^iLi ^i = Q ^'^'^ ^j — q/"^ /^'^ 
i = 1,2, ... ,p. Then there exists a partition of these integers into three sets, such that sum of 
integers in each set is at most q/2. 

Proof. Without loss of generality assume that p > 3, as otherwise the claim is trivial. We 
perform a greedy procedure as follows. At each time step of the procedure we have a number 
of sets, maintaining an invariant that each set is of size at most q/2. During the procedure we 
gradually merge the sets, i.e., we take two sets and replace them with their union. We begin 
with each integer in its own set. If we arrive at three sets, we end the procedure, thus achieving 
a feasible partition of the given integers. We therefore need to present how the merging step is 
performed. 

At each step we choose the two sets with smallest sums of elements and merge them (i.e., 
replace them by their union). As the number of sets is at least 4, the sum of elements of the 
two chosen ones constitute at most half of the total sum, so after merging them we obtain a set 
with sum at most q/2. Hence, unless the number of sets is at most 3, we can always apply this 
merging step. D 

Proof of Lemma \2. (A One of the implications is trivial: if there is a partition {Mi,M2,M^) 
oi G\X with the given properties, then every connected component ol G\ X must be fully 
contained either in Mi, M2, or M3, hence it contains at most \S\/2 vertices of S. We proceed 
to the second implication. 

Assume that X is a balanced S'-separator of G and let Ci, C2, . . . , Cp be connected compo- 
nents of G\X. For i = 1,2, ... ,p, let Op = IS n Cj|. By Lemma |7.3|, there exists a partition 
of integers Oi into three sets, such that the sum of elements of each set is at most 15*1/2. If we 
partition vertex sets of components Ci,G2, . . . ,Gp in the same manner, we obtain a partition 
(Ml, M2, M3) of V{G) \ X with postulated properties. D 

Lemma l2 . 91 shows that, when looking for a balanced S'-separator, instead of trying to bound 
the number of elements of S in each connected component of G[?7U5] \X separately, which could 
be problematic because of connectivity condition, we can just look for a partition of G[U U S] 
into four sets with prescribed properties that can be checked locally. This suggest the following 
definition of table T2. 

In table T2 we store entries for every node i of the tree decomposition, for every signature 
(j) = {Si, Ui) of Bi, and for every 8-tuple ip = (Mi, M2, M3, X, mi,m2, m3,x) where 

• (Ml, M2, M3, X) is a partition of SiUUi, 

• mi,m2,m^ are integers between and 15*1/2, 

• and X is an integer between and k + 1. 

This 8-tuple ip will be called the interface, and intuitively it encodes the interaction of a potential 
solution with the bag. Observe that the set U is not given in our graph directly but rather via 
connectivity information stored in table C , so we need to be prepared also for all the possible 
signatures of the bag; this is the reason why we introduce the interface on top of the signature. 
Note however, that the number of possible pairs (</>, -0) is at most 9' '' • k^"^', so for every bag 
Bi we store 9l^'l • k^^^> entries. 
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We proceed to the formal definition of what is stored in table T2- For a fixed signature (p = 
{Si, Ui) of Bi, let {Sf-\ C/f^*) be its extension, we say that partitioning (Mf=^*, M|^*, M|^*, X^^*) 
of S"?^* U f/P^* is an extension consistent with interface ip = (Mi, M2, M3,X, mi, 777-2,^3, x), if^ 

• X"""^ nBi = X and M|^* D Bi = Mj for j = 1, 2, 3; 

• there is no edge between vertices of M^^* and Mv"^* for j 7^ f; 

• iX^^t n VFil = x and IMJ'^* n Wi\ = m,- for j = 1, 2, 3. 
In entry T2 [i] [0] [V'] we store: 

• _L if (/) is invalid or no consistent extension of -0 exists; 

• otherwise, a list of length x of vertices of X'^^^ D Wi in some consistent extension of ip. 

The query findSSeparatorcan be realized in Oi^t'^^^') time by checking entries in the table 
T, namely T[r] [(0,0)] [(0,0, 0,0, mi, 777,2, "i-s, 2:)] for all possible values < ruj < \S\/2 and 
< X < A; + 1, and outputting the list contained in any of them that is not equal to _L, or _L if 
all of them are equal to _L. 

We now present how to compute entries of table T2 for every node i depending on the entries 
of children of i. We consider different cases, depending of the type of node i. For every case, 
we consider only signatures that are valid, as for the invalid ones we just put value _L. 

Case 1: Leaf node. If 7 is a leaf node then r2 [7] [(0,0)] [(0,0, 0,0, 0,0, 0, 0)] = 0, and all the 
other interfaces are assigned _L. 

Case 2: Introduce node. Let 7 be a node that introduces vertex v, and j be its only child. 
Consider some signature (p = {Si,Ui) of Bi and an interface ip = (Mi, M2, M3, X, 7771, 7712, 7713, x); 
we would like to compute ^2 [7] [(/>] [7/;] = Lj. Let (j)',^' be natural intersections of (pjip with 
Bj, respectively, that is, cp' = {Si D Bj, Ui D Bj) and V' = {Mi D Bj,M2 D Bj,M^ n Bj,X n 
Sj, 7/7,1,7/7,2, 7/7,3, x). Let T2[i][^'][V''] = -^j- We consider some sub-cases, depending on the 
alignment of tj in </> and ip. The cases with v belonging to Mi, M2 and M3 are symmetric, so 
we consider only the case for Mi. 

Case 2.1: v ^ X. Note that every extension consistent with interface ^ip is an extension 
consistent with ip' after trimming to Gj. On the other hand, every extension consistent with ip' 
can be extended to an extension consistent with ip by adding v to the extension of X. Hence, 
it follows that we can simply take Li = Lj. 

Case 2.2: v £ Mi. Similarly as in the previous case, every extension consistent with 
interface ip is an extension consistent with ip' after trimming to Gj. On the other hand, if we 
are given an extension consistent with ip' , we can add v to Mi and make an extension consistent 
with Ip if and only if v is not adjacent to any vertex of M2 or M3; this follows from the fact 
that Bj separates v from Wj, so the only vertices from M|^*, M^^^ that v could be possibly 
adjacent to, lie in Bj. However, if v is adjacent to a vertex of M2 or M3, we can obviously put 
Li = _L, as there is no extension consistent with ip: property that there is no edge between Mf^^ 
and M|^* U M^^^ is broken already in the bag. Otherwise, by the reasoning above we can put 
Li = Lj. 

Case 2.3: v £ Bi\ {Si U Ui). Again, in this case we have one-to-one correspondence of 
extensions consistent with ip with ip' after trimming to Bj, so we may simply put Lj = Lj. 

Case 3: Forget node. Let 7 be a node that forgets vertex w, and j be its only child. Consider 
some signature (p = {Si,Ui) of Bi, and some interface ip = {Mi, M2, M3, X,mi,m2,ms,x); we 
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would like to compute 72[i][0][V'] = L^. Let </>' = {Sj, Uj) be the only extension of signature (j) 
to Bj that has the same extension as (p; (j)' can be deduced by looking up which signatures are 
found valid in table C in the same manner as in the forget step for computation of table C. We 
consider three cases depending on alignment of w in (/)': 

Case 3.1: w ^ Sj L) Uj. If w is not in Sj U Uj, then it follows that we may put Lj = 
T2[j] [(/)']['(/'']: extensions of ip consistent with ip correspond one-to-one to extensions consistent 
with ip' . 

Case 3.2: w G Sj. Assume that there exist some extension (Mf^*,M2^*, Mg^*,^'^^*) 
consistent with ip. In this extension, vertex w is either in Mf^*, Mf^^, Mf^^, or in X^^^. Let us 
define the corresponding interfaces: 

• iIji = {MiU{w},M2,M3,X,mi - I,m2,m3,x); 
m ip2 = {Mi,M2U{w},M3,X,mi,m2 - 1, 1713, x); 

• ip3 = (Mi,M2,M3 U {■«;}, X, mi, m2, ma - l,x); 

• Vx = (Mi,M2,M3,XU{u;},mi,m2,m3,x-l). 

If any of integers rrii — l,m,2 — l,m3 — l,x — 1 turns out to be negative, we do not consider 
this interface. It follows that for at least one ip' £ {ipi,ip2,ip3,i^x} there must be an extension 
consistent with ip': it is just the extension (Mf ^* , Ml^* , M|^* , X'^^* ) . On the other hand, any 
extension consistent with any of interfaces ipi,ip2,i^3,ipx is also consistent with ip. Hence, we 
may simply put Lj = 72H[0'][V'']) and append w on the list in case ip' = ipx- 

Case 3.3: tf G Uj. We proceed in the same manner as in Case 3.2, with the exception that 
we do not decrement raj by 1 in interfaces V'j for j = 1, 2, 3. 

Case 4: Join node. Let i be a join node and ji,J2 be its two children. Consider some signature 
(p = {Si,Ui) of Bi, and an interface ip = {Mi,M2,M3,X,mi,m2,iTi3,x); we would like to 
compute T2 [i] [cp] [ip] = Li. Let (pi = {Si, Ui) be a signature of Bj-^ and (p2 = {Si, Ui) be a signature 
of Bj^. Assume that there is some extension {Mf^^, Mf^*, Mg^*, X'^^*) consistent with ip. Define 
m!q = \Wjp n Mg\ and x^ = {Wj^ n X\ for p = 1,2 and q = 1,2, 3; note that m^ + m^ = ruq for 
(7 = 1,2,3 and x^ + x^ = x. It follows that in Gj-^ , Gj^ there are some extensions consistent with 
(Ml, M2, M3, X ,m\,m\,m\,x^) and {Mi, M2, M3, X , mf, m^, m'^, x'^) , respectively — these are 
simply extension (M^*^*, M|^*,M|^*,X®^*) intersected with Vi,Vj, respectively. On the other 
hand, if we have some extensions in Gj-^, Gj^ consistent with {Mi,M2,M3,X,m\,m2,m3,x^) 
and (Ml, M2, M3, AT, m\,m^,m\,x'^) for numbers rrip, x^ such that rrig + rni = ruq for q = 1,2,3 
and x^ + x^ = x, then the point-wise union of these extensions is an extension consistent with 
(Mi,M2,M3, A, mi,m2 5'^3)a^)- It follows that in order to compute Li, we need to check if for 
any such choice of mp,xP we have non-± entries in T2[ji][(pi][{Mi,M2,M3,X,mi,m2,ml,x^)] 
and r2[J2][02][(Mi, M2, M3, A, m,^,m2, m-g, x^)]. This is the case, we put the union of the lists 
contained in these entries as Li, and otherwise we put _L. Note that computing the union of 
these lists takes 0{k) time as their lengths are bounded by k, and there is 0(/c^) possible choices 
of nip, x^ to check. 

Similarly as before, for every addition/removal of vertex v to/from 5 or marking/unmarking 
t; as a pin, we can update table T2 in 0(9* • k^^^ ■ logn) time by following the path from r^ 
to the root and recomputing the tables in the traversed nodes. Also, T2 can be initialized in 
0(9* • k^^^' • n) time by processing the tree decomposition in a bottom-up manner and applying 
the formula for every node. Note that updating/initializing table T2 must be performed after 
updating/initializing tables P and C. 
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7.4.3 Query findNextPin 

We now proceed to the next query. Recall that at each point, the algorithm maintains the set 
F of vertices marking components of G[U L) S]\ {X U S) that have been already processed. 
A component is marked as processed when one of its vertices is added to F. Hence, we need 
a query that finds the next component to process by returning any of its vertices. As in the 
linear-time approximation algorithm we need to process the components in decreasing order of 
sizes, the query in fact provides a vertex of the largest component. 



findNextPin 

Output: A pair {u,i), where (i) u is a vertex of a component of G[U L) S]\ {X U S) that 

does not contain a vertex from F and is of maximum size among such components, and (ii) 

i is the size of this component; or, _L if no such component exists. 

Time: 0(1) 



To implement the query we create a table similar to table C, but with entry indexing enriched 
by subsets of the bag corresponding to possible intersections with X and F. Formally, we store 
entries for every node i, and for every signature </> = {Si,Ui,Xi,Fi), which is a quadruple of 
subsets of Bi such that (i) SiCiUi = 0, (ii) Xi <^ SiL) Ui, (iii) Fi CUi\ Xi. The number of such 
signatures is equal to 6'^"-'. 

For a signature (j) = {Si,Ui, Xi, Fi), we say that {Sf"^^, C/f ""* , Xf ""* , Ff ""* ) is the extension of (j) 
if (i) (5f^*, t/f^*) is the extension of {Si, Ui) as in the table C, (ii) Xf * = XiU {Wi n X) and 
i?;cxt = Fj U {Wi n F). We may now state what is stored in entry T:>,[i][{Si, Ui,Xi, Fi)]: 

• if {Si, Ui) is invalid then we store _L; 

• otherwise we store: 

— an equivalence relation R between vertices of Ui\Xi, such that (fi,U2) G R if and 
only if vi, V2 are connected in G[C/f^* \ X"?^^]; 

— for every equivalence class K of R, an integer mx equal to the number of vertices 
of the connected component of G[Uf^^ \ Xf^^] containing K, which are contained in 
Wi, or to _L if this connected component contains a vertex of Ff^*; 

— a pair {u, m), where m is equal to the size of the largest component of G[U^^^ \ X'?^^] 
not containing any vertex of F?^^ or Ui, while u is any vertex of this component; if 
no such component exists, then {u,m) = (_L,_L). 

Clearly, query findNextPinmay be implemented by outputting the pair {u, m) stored in the 
entry T3[r][(0,0,0,0)], or _L if this pair is equal to (_L,_L). 

We now present how to compute entries of table T3 for every node i depending on the entries 
of children of i. We consider different cases, depending of the type of node i. For every case, 
we consider only signatures {Si,Ui,Xi,F,i) for which {Si,Ui) is valid, as for the invalid ones we 
just put value _L. 

Case 1: Leaf node. If i is a leaf node then Tg^ [(0,0, 0,0)] = (0, 0, (_L, _L)). 

Case 2: Introduce node. Let i be a node that introduces vertex v, and j be its only 
child. Consider some signature (/> = {Si,Ui,Xi,Fi) of Bi] we would like to compute T'3[i][(/)] = 
{Ri, {m]^)KeRi, (""ii iT^i))- Let cp' be a natural projection of (j) onto Bj, that is, (j)' = {SiCiBj, UiCi 
Bj,Xi n Bj,Fi n Bj). Let T3[j][(^'] = {Rj, {m^^)K<^Rj-, {uj,mj)); note that this entry we know, 
but entry T3[i][(/>] we would like to compute. We consider some sub-cases, depending on the 
alignment of u in (j). 



Case 2.1: v & Ui\ {Xi U Fi). If we introduce a vertex from Ui \ {Xi U Fi), then the extension 
of (j) is just the extension of cp' plus vertex v added to C/f^*. If we consider the equivalence classes 
of Ri, then these are equivalence classes of Rj but possibly some of them have been merged 
because of connections introduced by vertex v. As Bj separates v from Wj, v could only create 
connections between two vertices from Bj D {Uj \ Xj). Hence, we can obtain Ri from Rj by 
merging all the equivalence classes of vertices of Uj\Xj adjacent to v; the corresponding entry in 
sequence {mK)K<^Ri is equal to the sum of entries from the sequence {rrv'^)K<^Rj corresponding 
to the merged classes. If any of these entries is equal to _L, we put simply _L. If f was not 
adjacent to any vertex of Uj \Xj, we put t; in a new equivalence class K with rriK = 0. Clearly, 
we can also put {ui,mi) = {uj,mj). 

Case 2.2: v £ {Ui \ Xi) n Fi. We perform in the same manner as in Case 2.2, with 
the exception that the new entry in sequence {'mK)K£Ri will be always equal to _L, as the 
corresponding component contains a vertex from F?^^. 

Case 2.3: v £ SiL) Xi. In this case we can simply put T3[i][(/)] = T3[j] [(/>'] as the extensions 
of (p and (/)' are the same with the exception of v being included into Xf^^ and/or into S*?^*, 
which does not influence information to be stored in the entry. 

Case 2.4: v G Bi\ (Si U Ui). In this case we can simply put T3[i][(/)] = T3[j] [(/>'] as the 
extensions of (p and (p' are equal. 

Case 3: Forget node. Let i be a node that forgets vertex w, and j be its only child. Consider 
some signature (p = {Si, Ui,Xi, Fi) oi Bi; we would like to compute T'3[i][(/)] = {Ri, {m)^)K£Ri, {ui,mi)). 
Let {Sf^^,U^^^,Xf^^,F^^^) be extension of (p. Observe that there is exactly one signature 
(p' = {Sj,Uj,Xj,Fj) of Bj with the same extension as (p, and this signature is simply (p with 
w added possibly to Si, Ui, Xi or Fi, depending whether it belongs to Sf^^, Uf^^ , Xf^^, or 
^ext_ Coloring (p' may be defined similarly as in case of forget node for table C; we just need 
in addition to include w in Xf^^ or F?'^'^ if it belongs to X or F, respectively. 

Let T3[j][(/)] = {Rj, (r7T,]^)xe/?j) {'^ji^'^j))- As the extensions of and (p' are equal, it follows 
that we may take Ri equal to Rj with w possibly excluded from its equivalence class. Similarly, 
for every equivalence class K £ Ri we put m]^ equal to w,]^,, where K' is the corresponding 
equivalence class of Rj , except the class that contained w which should get the previous number 
incremented by 1, providing it was not equal to _L. We also put {ui,mi) = {uj,mj) except the 
situation, when we forget the last vertex of a component of G[C/?^* \ X^^*]: this is the case 
when w is in Uj \ Xj and constitutes a singleton equivalence class of Rj. Let then fn'r^ be the 
corresponding entry in sequence {m-'j^)KeRy If "T-j^i = -L, we simply put {ui,mi) = {uj,mj). 
Else, if {uj,mj) = (-L,-L) or m-^ry > rrij, we put {ui,mi) = {111,1711 -,), and otherwise we put 
{ui,mi) = {uj,mj). 

Case 4: Join node. Let i be a join node and j'l, j2 be its two children. Consider some signature 
(p = {Si,Ui,Xi,Fi) of Bi; we would like to compute T3[i][(^] = {R,i,{nt^)K(^Ri,{ui,mi)). Let 
01 = {Si,Ui,Xi, Fi) be a signature of Bj-^ and (p2 = {Si,Ui,Xi, Fi) be a signature of Bj^. Let 
^3[ii][<Ai] = {Rji,{mK)K€R,^,{un,mj^)) and T3[J2][cp2] = {Rj.„{m'^)K(iR,^,{uj^,mj^)). Note 
that equivalence relations Rj-^ and Rj^ are defined on the same set Ui \ Xj . It follows from the 
definition of T3 that we can put: 

• Ri to be the minimum transitive closure of Rj^ U Rj^ ; 

• for every equivalence class K oi Ri, rn^j^ equal to the sum of (i) numbers rrv'^ for Ki C K, 
Ki being an equivalence class of Rj^, and (ii) numbers m]^ for K2 ^ K, K2 being an 
equivalence class of Rj,^, if any of these numbers is equal to _L, we put m^ = _L; 
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• {ui,mi) to be equal to {uj^^rrij^) or {uj^,mj^), depending whether rrij^ or rrij^ is larger; if 
any of these numbers is equal to _L, we take the second one, and if both are equal to _L, 
we put {ui,mi) = (-L,-L). 

Similarly as before, for every addition/removal of vertex v to/from S, to/from X, to/from 
F, or marking/unmarking f as a pin, we can update table T^ in 0(6* • t'^^^' • logn) time by 
following the path from r„ to the root and recomputing the tables in the traversed nodes. Also, 
T3 can be initialized in 0{6^ -t^^' ■n) time by processing the tree decomposition in a bottom-up 
manner and applying the formula for every node. Note that updating/initializing table Ts must 
be performed after updating/initializing tables P and C. 

7.4.4 Query findUSeparator 

In this section we implement the last query, needed for the linear-time algorithm; the query is 
significantly more involved than the previous one. The query specification is as follows: 



findUSeparator 

Output: A list of elements of a |-balanced separator of G\U] of size at most A; -|- 1, or _L if 

no such exists. 

Time: 0(c* • A;'^(i) -logn) 



Note that Lemma l2.ll guarantees that in fact G\U] contains a g-balanced separator of size 
at most k -\-l. Unfortunately, we are not able to find a separator with such a good guarantee 
on the sizes of the sides; the difficulties are explained in Section [2j Instead, we again make use 
of the precomputed approximate tree decomposition to find a balanced separator with slightly 
worse guarantees on the sizes of the sides. 

In the following we will also use the notion of a balanced separation. For a graph G, we 
say that a partition {L,X,R) of V{G) is an a-balanced separation of G, if there is no edge 
between L and R, and \L\, \R\ < a\V{G)\. The order of a separation is the size of X. Clearly, if 
{L,X, R) is an a-balanced separation of G, then X is an a-balanced separator of G. By folklore 
[see the proof of Lemma [2. 2 j we know that every graph of treewidth at most k has a |-balanced 
separation of order at most k + 1. 

Expressing the search for a balanced separator as a maximization problem. Before 

we start explaining the query implementation, we begin with a few definitions that enable us 
to express finding a balanced separator as a simple maximization problem. 

Definition 7.4. Let G be a graph, and Tl, Tr be disjoint sets of terminals in G. We say that a 
partition {L,X,R) ofV{G) is a terminal separation of G of order £, if the following conditions 
are satisfied: 

(i) TlQL and Tr C R; 
(ii) there is no edge between L and R; 
(ih) \X\ < i. 

We moreover say that {L,X,R) is left-pushed {right-pushed) if \L\ {\R\) is maximum among 
possible terminal separations of order i. 

Pushed terminal separations are similar to important separators of Marx [33], and their 
number for fixed Tl^Tr can be exponential in L Pushed terminal separations are useful for us 
because of the following lemma, that enables us to express finding a small balanced separator 
as a maximization problem, providing that some separator of a reasonable size is given. 
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Lemma 7.5. Let G be a graph of treewidth at most k and let {Ai, B, A2) be some separation of 
G, such that \Ai\, \A2\ < fl^CG)!- Then there exists a partition {Tl,Xb,Tr) of B and integers 
ki,k2 withki + k2 + \XB\ <k + l, such that if Gi,G2 are G[AiU{B\Xb)] and G[A2LI{B\Xb)] 
with terminals Tl,Tb,, then 

(i) there exist a terminal separations ofGi,G2 of orders ki,k2, respectively; 

(a) for any left-pushed terminal separation {Li,Xi,Ri) of order ki in Gi and any right-pushed 
separation (L2, X2, R2) of order k2 in G2, the triple {Li[JTlUL2, XiUXbUX2, RiUTb,UR2) 
is a terminal separation of G of order at most fc + 1 with \Li UTiU L2I, |-Ri UTjiU R2\ < 
||y(G)| + ^^^^. 

Proof. As the treewidth of G is at most k, there is a separation (L, X, R) of G such that 
|-^M-R| ^ ll^lG*)! and \X\ < A; + 1 by folklore [see the proof of lemma [2.2] . Let us set 
{Tl,Xb,Tr) = {LnB,XnB,RnB), ki = |Xn^i| and /c2 = \XnA2\. Observe that X n Ai 
and X n A2 are terminal separations in Gi and G2 of orders ki and k2, respectively, hence we 
are done with (i). We proceed to the proof of (ii). 

Let us consider sets LCiAi, LnA2, RCiAi and RnA2. Since {Ai, B,A2) and {L,X,R) are 
J- and 3- balanced separations, respectively, we know that: 

• |L n ^il + |L n A2I + \B\ > l\V{G)\ -ik + 1); 

• \Rr\Ai\ + \Rr\A2\ + \B\ > l\V{G)\ - (k + l); 

• \LnAi\ + \RnAi\ + {k + l)>l\ViG)\-\B\; 

• \LnA2\ + \RnA2\ + {k + l)>l\ViG)\-\B\. 

We claim that either |Lnyli|, |i?n^2| > ||F(G)| - l^'+f +^\ or |Ln A2I, |i?n^i| > l\V{G)\- 
\B±p^. Assume first that |L n ^i| < l\V{G)\ - i^M^±l). Observe that then \L D ^2! > 
l\V{G)\ - \B\ -{k + l)- {l\V{G)\ - LM±i)) > l\V{G)\ - M±Mi) . Similarly, \RnAi\> 
l\V{G)\ - \B\ -{k + l)- {l\V{G)\ - l^l+(^+^) ) > i|l/(G)|-M±i^±i). The case when I i?n ^2 1 < 

||F(G)| — ' ' 2 is symmetric. Without loss of generality, by possibly flipping separation 

{L,X,R), assume that \Ln Ai\,\Rn A2\ > l\V{G)\ _ L5M^±1). 

Let {Li,Xi, Ri) be any left-pushed terminal separation of order ki in Gi and {L2,X2, R2) 
be any right-pushed terminal separation of order /c2 in G2. By the definition of being left- 
and right-pushed, we have that |Li n ^i| > \L n Ai\ > l\V{G)\ - '^'"^f^^^ and |i?2 n A2\ > 
\R n A2I > l\V{G)\ - M±Mi) . Therefore, we have that |Li U T^ U L2I < l\V{G)\ + M±i^±i) 
and|LiUTLUL2| <l\V{G)\+ l^l+(^'+^) . D 

The idea of the rest of the implementation is as follows. First, given an approximate tree 
decomposition of with 0{k) in the data structure, in logarithmic time we will find a bag Big 
that splits the component L'^ in a balanced way. This bag will be used as the separator B in the 
invocation of Lemma 17. 5| the right part of the separation will consist of vertices contained in 
the subtree below Big, while the whole rest of the tree will constitute the left part. Lemma 17.51 
ensures us that we may find some balanced separator of U by running two maximization dynamic 
programs: one in the subtree below Bi^ to identify a right-pushed separation, and one on the 
whole rest of the tree to find a left-pushed separation. As in all the other queries, we will store 
tables of these dynamic programs in the data structure, maintaining them with 0(c* log n) 
update times. 
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Case of a small U At the very beginning of the implementation of the query we read \U\, 
which is stored in the entry CardU[r][{(/},$)]. If it turns out that \U\ < 36(/c + t + 2) = 0{k), 
we perform the following explicit construction. We apply a DFS search from tt to identify the 
whole U; note that this search takes 0{k'^) time, as U and S are bounded linearly in k. Then 
we build subgraph G[C/], which again takes 0{k'^) time. As this subgraph has 0{k) vertices and 
treewidth at most k, we may find its 2 -balanced separator of order at most fc+1 in c time using 
a brute- force search through all the possible subsets of size at most k + 1. This separator may 
be returned as the result of the query. Hence, from now on we assume that \U\ > 36{k + t + 2). 

Tracing U We first aim to identify bag Bi^ in logarithmic time. The following lemma en- 
capsulates the goal of this subsection. Note that we are not only interested in the bag itself, 
but also in the intersection of the bag with of S and U (defined as the connected component 
oi G\ S containing vr). While intersection with S can be trivially computed given the bag, we 
will need to trace the intersection with U inside the computation. 

Lemma 7.6. There exists an algorithm that, given access to the data structure, in O^t'^^^'-logn) 
time finds a node io of the tree decomposition such that |f7|/4 < |Wj,j riU\ < \U\/2 together with 
two subsets Ui, Si of Bi^ such that Uq = U (1 B^^ and Sq = S (1 B^^ . 

Proof. The algorithm keeps track of a node i of the tree decomposition together with a pair of 
subsets (C/j, Si) = {BiCiU, BiOS) being the intersections of the bag associated to the current node 
with U and S, respectively. The algorithm starts with the root node r and two empty subsets, 
and iteratively traverses down the tree keeping an invariant that CardU[i][(Ui,Si)] > |C/|/2. 
Whenever we consider a join node i with two sons ji , J2 , we choose to go down to the node where 
CardU[^t][{Uj^,Uj^)] is larger among t = 1,2. In this manner, at each step CardU[i][{Ui, Si)] 
can be decreased by at most 1 in case of a forget node, or can be at most halved in case 
of a join node. As \U\ > 36{k + t + 2), it follows that the first node io when the invariant 
CardU\i][{Ui,Si)] > \U\/2 ceases to hold, satisfies \U\/4 < CardU[io][{Ui^-,,Si^-,)] < \U\/2, and 
therefore can be safely returned by the algorithm. 

It remains to argue how sets (C/j, Si) can be updated at each step of the traverse down the 
tree. Updating Si is trivial as we store an explicit table remembering for each vertex whether 
it belongs to S. Therefore, now we focus on updating U. 

The cases of introduce and join nodes are trivial. If i is an introduce node with son j, then 
clearly Uj = UiD Bj. Similarly, if i is a join node with sons ji,J2, then Uj^ = Uj^ = Ui. We are 
left with the forget node. 

Let i be a forget node with son j, and let Bj = BiL) {w}. We have that Uj = t/j U {w} or 
Uj = Ui, depending whether w £ U or not. This information can be read from the table C[j] 
as follows: 

• if C[j][{Ui U {w}, Sj)] = ±, then w (^ U and Uj = Ui; 

• if C[j][{Ui, Sj)] = ±, then w e U and Uj = UiU {w}; 

• otherwise, both C[j][{Ui, Sj)] and C[j][{UiU{w}, Sj)] are not equal to _L; this follows from 
the fact that at least one of them, corresponding to the correct choice whether w £ U or 
w ^ U, must be not equal to _L. Observe that in this case w is in a singleton equivalence 
class of C[j][{UiU{w} , Sj)], and the connected component ofw in the extension o{UiU{w} 
cannot contain the pin tt. It follows that w ^ U and we take Uj = Ui. 

Computation at each step of the tree traversal takes 0(t^^^') time. As the tree has loga- 
rithmic depth, the whole algorithm runs in 0{t^^' • logn) time. D 
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Dynamic programming for pushed separators In this subsection we show how to con- 
struct dynamic programming tables for finding pushed separators. The implementation resem- 
bles that of table r2, used for balanced S'-separators. 

In table T4 we store entries for every node i of the tree decomposition, for every signature 
(f) = (Si, Ui) of Bi, and for every 4-tuple ip = (L, X, R, x), called again the interface, where 

• (L, X, R) is a partition of Ui, 

• X is an integer between and k + 1. 

Again, the intuition is that the interface encodes the interaction of a potential solution with the 
bag. Note that for every bag Bi we store at most 5'^"-' ■ {k + 2) entries. 

We proceed to the formal definition of what is stored in table T4. Let us fix a signature 
(j) = {Si,Ui) of Bi, and let {Sf^^,Uf^^) be its extension. For an interface ip = {L,X,R,x), we 
say that a terminal separation (L'^^*,X'^^*, i?'^^*) in G\Uf^^] with terminals L,R is an extension 
consistent with interface tp = (L, X, R, x) if 

• L'^^* r\Bi = L, X^^* nBi = X and i?^^* r\B, = R; 

• |x^^*nVFi| =x. 

Then entry r4 [i] [</>] [7/>] contains the pair (r, Xq) where r is the maximum possible |i?*^^* n Wi\ 
among extensions consistent with ^ip, and Xq is the corresponding set X^^'^ f^ Wi for which this 
maximum was attained, or _L if the signature (j) is invalid or no consistent extension exists. 

We now present how to compute entries of table T4 for every node i depending on the entries 
of children of i. We consider different cases, depending of the type of node i. For every case, 
we consider only signatures that are valid, as for the invalid ones we just put value _L. 

Case 1: Leaf node. If i is a leaf node then r4[i][(0,0)][(0,0, 0, 0)] = (0,0), and all the other 
entries are assigned _L. 

Case 2: Introduce node. Let i be a node that introduces vertex v, and j be its only child. 
Consider some signature (j) = {Si,Ui) of Bi and an interface V' = {L,X,R,x); we would like to 
compute X4 [i] [0] [■(/'] = (rj,XQ). Let (p' ,11)' be natural intersections of <t),ip with Bj, respectively, 
that is, 0' = (S'i n Bj , Ui n Bj ) and ip' = {LnBj,X nBj,RnBj,x). Let T4 [j] [cp'] [ip'] = (rj , X^' ) . 
We consider some sub-cases, depending on the alignment of f in and ip. The cases with v 
belonging to L and R are symmetric, so we consider only the case for L. 

Case 2.1: v £ X. Note that every extension consistent with interface ip is an extension 
consistent with ip' after trimming to Gj. On the other hand, every extension consistent with ip' 
can be extended to an extension consistent with ip by adding v to the extension of X. Hence, 
it follows that we can simply take (rj,Xg) = {vjjXq). 

Case 2.2: v £ L. Similarly as in the previous case, every extension consistent with interface 
Ip is an extension consistent with ip' after trimming to Gj. On the other hand, if we are given 
an extension consistent with ip' , then we can add v to L and make an extension consistent 
with Ip if and only if v is not adjacent to any vertex of R; this follows from the fact that Bj 
separates v from Wj, so the only vertices from i?*^^* that v could be possibly adjacent to, lie in 
Bj. However, if v is adjacent to a vertex of R, then we can obviously put (rj, Xq) = _L as there 
is no extension consistent with ip: property that there is no edge between L and R is broken 
already in the bag. Otherwise, by the reasoning above we can put (rj,XQ) = (r^jXp). 

Case 2.3: v £ Bi\ Ui. Again, in this case we have one-to-one correspondence of extensions 
consistent with ip and extensions consistent with ip' , so we may simply put (rj, ATq) = (vjjXq). 
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Case 3: Forget node. Let i be a node that forgets vertex w, and j be its only child. Consider 
some signature (j) = (St, Ui) of Bi, and some interface V' = {L-, X, R, x); we would like to compute 
T4 [i] [(/)] [7/;] = {ri,X^). Let (j)' = {Sj,Uj) be the only the extension of signature (j) to Bj that has 
the same extension as </>; (j)' can be deduced by looking up which signatures are found valid in 
table C in the same manner as in the forget step for computation of table C. We consider two 
cases depending on alignment of w in (p': 

Case 3.1: w ^ Uj. If w is not in Uj, then it follows that we may put (rijX^) = T4[j][(p'][tp']: 
extensions consistent with V' correspond one-to-one to extensions consistent with ip' . 

Case 3.2: w € Uj. Assume that there exists some extension {L^^^ , X^^^ , R^^^) consistent 
with ip, and assume further that this extension is the one that maximizes \R^^^ n Wi\. In this 
extension, vertex w is either in L°^*, X'^^^, or in i?°^*. Let us define the corresponding interfaces: 

• V'L = {LU{w},X,R,xy, 

• iJx = iL,XU{w},R,x-iy, 

• ijR = iL,X,R\j{w},x). 

If X — 1 turns out to be negative, we do not consider -i/^x- For t G {L,X,R}, let {rjjX^ ) = 
^4[i][0'][V't]- It follows that for at least one ip' £ {ipL,ipx,'4'R} there must be an extension 
consistent with ip': it is just the extension (L^^^ , X'^^^ , R"^^^). On the other hand, any extension 
consistent with any of interfaces V'L) V'Xj ^i? is also consistent with ^p. Hence, we may simply put 
Vi = uiax{rL,rx,rji + 1), and define Xq as the corresponding X^ , with possibly w appended if 
t = X. Of course, in this maximum we do not consider the interfaces ipt for which T^lj] [(p'] [ipt\ = 
_L, and if Ti[j\[(t)'][%ljt] = -L for ah t G {L,X,R], we put (ri,X^) = _L. 

Case 4: Join node. Let i be a join node and Ji,i2 be its two children. Consider some 
signature (p = {Si,Ui) of Bi, and an interface ^p = {L,X,R,x); we would like to compute 
r4[i][^]['0] = {ri,XQ). Let cpi = {Si,Ui) be a signature of Bj-^ and (p2 = {Si,Ui) be a signature 
of Bj^. Assume that there is some extension (L*^^^ , X°^^ , R'^^^) consistent with ip, and assume 
further that this extension is the one that maximizes \R^^^ n WA. Define r^ = \Wi„ n R\ and 



xP = \Wjp n X\ for p = 1,2; note that r^ -\- r"^ = n and x^ + x"^ = x. It follows that in Gj^, 
Gj2 there are some extensions consistent with {L,X, R,x^) and (L,X, i?, x^), respectively — 
these are simply extension {L^^^ , X*^^^ , R'^^^) intersected with Vi,Vj, respectively. On the other 
hand, if we have some extensions in Gj^, Gj^ consistent with {L,X,R,x^) and {L,X, R,x'^) for 
numbers x^ such that x^ + x^ = x, then the point-wise union of these extensions is an extension 
consistent with {L,X,R,x). It follows that in order to compute (rj,XQ), we need to iterate 
through choices of x^ such that we have non-_L entries in T2[ji][0i][(L,X, i?, x^)] = {r^ jX^-'"^ ) 
and T2[j2][02][(-^)^5 ^)2^^)] = {^^ jXq^''^ ), choose X"'^,x^ for which r^ + r^ is maximum, and 

1 o ■ 1 '2 

define (rj,Xg) = (rj^ -|- r|^ ,Xq-'^ U Xq^'^ ). Of course, if for no choice of x^,x^ it is possible, 

we put (rj, Xq) = _L. Note that computing the union of the sets Xq^'^ for p = 1,2 takes 0{k) 
time as their sizes are bounded by k, and there is 0{t) possible choices of x^ to check. 

Similarly as before, for every addition/removal of vertex v to/from S or marking/unmarking 
v as a pin, we can update table T4 in 0(5* • A; '^' • logn) time by following the path from r^ 
to the root and recomputing the tables in the traversed nodes. Also, T4 can be initialized in 
0(5* • k^^^' ■ n) time by processing the tree decomposition in a bottom-up manner and applying 
the formula for every node. Note that updating/initializing table T4 must be performed after 
updating/initializing tables P and G. 
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Implementing query findUSeparator We now show how to combine Lemmata 17.51 and 17.61 
with the construction of table T4 to implement query findUSeparator. 

The algorithm performs as follows. First, using Lemma 17.61 we identify a node io of the 
tree decomposition, together with disjoint subsets (C/jg, S'jq) = {U Ci Bi^^ S n Bi^^) of -Bjo, such 
that \U\/A < \Wig r\U\ < \U\/2. Let A2 = Wi^ and Ai = V{G) \ Fi,,. Consider separation 
(^1 n [/, Sio n [/, ^2 n U) of G[U] and apply Lemma 1731 to it. Let (T^, X%,T'^) be the partition 
of -Bjg and k^, k2 be the integers with ki+k2 + \X^\ < k + 1, whose existence is guaranteed by 
Lemma 17.51 

The algorithm now iterates through all possible partitions {Tl^Xb-,Tr) of fij,, and integers 
ki, k2 with fci + /c2 + \Xb\ < k + 1. We can clearly discard the partitions where there is an edge 
between T^ and Tr. For a partition {Tl^XbiTr), let Gi,G2 be defined as in Lemma 1731 for 
the graph G[U]. For a considered tuple (Tl, Xb ,Tii, ki, k2) , we try to find: 

(i) a separator of a right-pushed separation of order k2 in G2, and the corresponding cardi- 
nality of the right side; 

(ii) a separator of a left-pushed separation of order ki in Gi , and the corresponding cardinality 
of the left side. 

Goal (i) can be achieved simply by reading entries T'4[io][(C/io, 5i(,)][(Tj,, X5, Tr, k')] for k' < k2, 
and taking the right-pushed separation with the largest right side. We are going to present how 
goal (ii) is achieved in the following paragraphs, but firstly let us show that achieving both of 
the goals is sufficient to answer the query. 

Observe that if for some {Tl,Xb,Tji) and (^1,^2) we obtained both of the separators, 
denote them Xi,X2, together with cardinalities of the corresponding sides, then using these 
cardinalities and precomputed \U\ we may check whether X1LIX2UXB gives us a |-separation of 
G[[/]. On the other hand, Lemma[73] asserts that when (T^, X^,T^) and {k^, ^2) are considered, 
we will find some pushed separations, and moreover any such two separations will yield a |- 
separation of G[C/]. Note that this is indeed the case as the sides of the obtained separation have 
cardinalities at most ||C/| + (fe+^)+(*+i) = ||c/| + tk^ - ^ < ||f/|, since \U\ > 36(yt + t + 2). 

We are left with implementing goal (ii). Let G'^ be Gi with terminal sets swapped; clearly, 
left-pushed separations in Gi correspond to right-pushed separations in G'^. We implement 
finding a right-pushed separations in G\ as follows. 

Let P = {io,ii, . . . ,ih = r) be the path from iq to the root r of the tree decomposition. The 
algorithm traverses the path P, computing tables D[it\ for consecutive indexes t = 1, 2, . . . , t. 
The table D[it\ is indexed by signatures (j) and interfaces ip in the same manner as T4. Formally, 
for a fixed signature (p = (5'it, C/jJ oi Bi^ with extension {Sf^^, Uf^^), we say that this signature is 
valid with respect to {Si^, Ui^) if it is valid and moreover {Si^, Ui^) = {Sf^^nBi^, U^^^nBig). For 
an interface ijj we say that separation {L^^^ , X°^^ , R^^^) in G[Uf^^ \ VFjg] with terminals L,R is 
consistent with ip with respect to {TL,XB,Tfi), if it is consistent in the same sense as in table T4, 
and moreover (T^ , Xb , Tr) = (L*^^* n Bi^ , X^^* n Bi^ , i?^^* nBiJ. Then entry T[it] [(/)] [V'] contains 
the pair (r, Xq) where r is the maximum possible |i?''^*nTyj| among extensions consistent with ■0 
with respect to {Tl,Xb,Tr), and Xq is the corresponding set X'^^^CiWi for which this maximum 
was attained, or _L if the signature (p is invalid with respect to {Si^, Ui^) or no such consistent 
extension exists. 

The tables D\it\ can be computed by traversing the path P using the same recurrential 
formulas as for table T4. When computing the next -D[it], we use table Z)[ii_i] computed in the 
previous step and possible table T^ from the second child of it- Moreover, as D[io] we insert the 
dummy table Dummy [(j)\[tlj] defined as follows: 

. Dummy[{U,,,Si,)][{TR,XB,TLM=^\ 
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• all the other entries are evaluated to _L. 

It is easy to observe that table Dummy exactly satisfies the definition of D[iQ]. It is also 
straightforward to check that the recurrential formulas used for computing T4 can be used in 
the same manner to compute tables D[it] for t = 1,2, . . . ,h. The definition of D and the method 
of constructing it show, that the values Z)[r][(0, 0)][(0, 0, 0, x)] for x = 0,1, . . . ,k, correspond to 
exactly right-pushed separations with separators of size exactly x in the graph G'l : insertion of 
the dummy table removes A2 from the graph and forces the separation to respect the terminals 
in Big . 

Let us conclude with a summary of the running time of the query. Algorithm of Lemma 17.61 
uses 0{t^^^' ■ logn) time. Then we iterate through at most 0(3* • k"^) tuples {Tl,Xb,Tr) and 
(/ci, ^2), and for each of them we spend 0{k) time on achieving goal (i) and 0(5* • A;'^*--'^^ • logn) 
time on achieving goal (ii). Hence, in total the running time is 0(15* • A; '^^ • logn). 
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